GOLOD-SHAFAREVICH GROUPS: A SURVEY 



MIKHAIL ERSHOV 



Abstract. In this paper we survey the main results about Golod-Shafarevich groups and 
their applications in algebra, number theory and topology. 
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1. Introduction 

1.1. The discovery of Golod-Shafarevich groups. Golod-Shafarevich groups have been 
introduced (or rather discovered) in connection with the famous class field tower problem, 
which asks whether the class field tower of any number field is finite. This classical number- 
theoretic problem, posed by Furtwangler in 1925, remained open for almost 40 years, with no 
clear indication whether the answer should be positive or negative. By class field theory, the 
problem is equivalent to the non-existence of a number field K whose maximal unramified 
prosolvable extension has infinite degree (over K). A convenient way to construct K with 
the latter property (and thus settle the class field tower problem in the negative) would be to 
show that for some prime p the maximal unramified p-extension K p of K has infinite degree, 
or equivalently, the Galois group Gk, p = Gal(K p / K) is infinite (note that Gk, p is a pro-p 
group, so if finite, it must be a p-group). 

A major evidence for the negative answer to the class field tower problem was given by 
the 1963 paper of Shafarevich [Sh], where the formula for the minimal number of generators 
d(GK, P ) of Gk, p and an upper bound for the minimal number of relations t(Gk,p) were 
established. These results implied that for any prime p, there exists an infinite sequence 
of number fields {K(n)} such that if G n = Gjc( n ) iP , then d(G n ) — > oo as n — > oo and 
r{G n ) — d(G n ) remains bounded. Shafarevich conjectured that there cannot be any sequence 
of finite p-groups with these two properties (which would imply that in the above sequence 
G n must be infinite for sufficiently large n). A year later, in 1964, Golod and Shafarevich [GS| 
confirmed this conjecture by showing that for any finite p- group G the minimal numbers of 
generators d(G) and relators r(G) (where G is considered as a pro-p group) are related by 
the inequality r(G) > (d(G) - l) 2 /4 (this was improved to r(G) > d(G) 2 /4 in the subsequent 
works of Vinberg jVi] and Roquette |Ro| ) . 
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1.2. Golod-Shafarevich inequality. The algebraic tool used to prove that r(G) > d(G) 2 /A 
for a finite p-group is the so called Golod-Shafarevich inequality. It can be formulated in 
many different categories, including graded (associative) algebras, complete filtered algebras 
(algebras defined as quotients of algebras of power series K((u±, . . . ,Ud)) in non-commuting 
variables ui, . . . ,Ud), abstract groups and pio-p groups, and relates certain growth function 
of an object in one of these categories with certain data coming from the presentation of that 
object by generators and relators. The main consequence of the Golod-Shafarevich inequality 
is that if the set of relators defining an object is "small" (in certain weighted sense) compared 
to the number of generators, then the object must be infinite in the case of groups and infinite- 
dimensional in the case of algebras. Groups and algebras which admit a presentation with 
such a "small" set of relators are called Golod-Shafarevich. 

A well-known consequence of the Golod-Shafarevich inequality (which is sufficient for the 
solution of the class field tower problem) is that a pro-p group G such that r(G) < d(G) 2 /4 
must be infinite - this is an example of what it means for the set of relators to be "small" . 
However, as we already mentioned, the relators are counted with suitable weights, so even an 
infinite set of relators can be "small". In particular, it is easy to see that there exist Golod- 
Shafarevich abstract groups which are torsion. This result was established by Golod [Gol] 
and yielded the first examples of infinite finitely generated torsion groups, thereby settling 
in the negative the general Burnside problem. This is the second major application of the 
Golod-Shafarevich inequality. 

1.3. Applications in topology. The majority of works on Golod-Shafarevich groups in 70s 
and early 80s dealt with variations and generalizations of the inequality r(G) > d(G) 2 /4 both 
in group-theoretic and number-theoretic contexts, but no really new applications of Golod- 
Shafarevich groups were discovered. In 1983, Lubotzky [Lul| made a very important observa- 
tion that the fundamental groups of (finite- volume orientable) hyperbolic 3-manifolds (which 
can be equivalently thought of as torsion-free lattices in SL2 (C)) are Golod-Shafarevich up 
to finite index. Using this result, in the same paper Lubotzky solved a major open problem, 
known at the time as Serre's conjecture, which asserts that arithmetic lattices in SL%(C) 
cannot have the congruence subgroup property. Lubotzky's proof was highly original, and 
even though Golod-Shafarevich techniques constituted a relatively small (and technically not 
the hardest) part of the argument, it gave hope that other, possibly more difficult, problems 
about 3-manifolds could be settled with the use of Golod-Shafarevich groups. This line of 
research turned out to be quite successful, and even though no breakthroughs of the magni- 
tude of the proof of Serre's conjecture had been made, several important new results about 
hyperbolic 3-manifold groups had been discovered, including very strong lower bounds on the 
subgroup growth of such groups by Lackenby [Lai} ILa2j . Equally importantly, the potential 
applications in topology served as an extra motivation for developing the general structure 
theory of Golod-Shafarevich groups, and many interesting (and useful for other purposes) 
results in that area were obtained in the past few years. 

1.4. General structure theory of Golod-Shafarevich groups. The initial applications 
in the works of Golod and Shafarevich [GS[ IGol] only required a sufficient condition for a 
group given by generators and relators to be infinite. However, the groups satisfying that 
condition (Golod-Shafarevich groups) turn out to be not only infinite - they are in fact big 
in many different ways. Already the arguments in the original paper [GSJ show that for any 
Golod-Shafarevich group G with respect to a prime p, the graded algebra associated to its 
group algebra F p [G] has exponential growth. Combining this result with Lazard's criterion, 
Lubotzky observed that Golod-Shafarevich pio-p groups are not p-adic analytic - this was 
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a key observation in the proof of Serre's conjecture in |Lul] . In [Wil], Wilson proved that 
every Golod-Shafarevich groups has an infinite torsion quotient, using a simple modification 
of Golod's argument [Golj . Two deeper results which required essentially new ideas had 
been established more recently. In [Zel] . Zelmanov proved a remarkable theorem asserting 
that every Golod-Shafarevich pro-p group contains a non-abelian free pvo-p subgroup. This 
result, clearly very interesting from a purely group-theoretic point of view, is also important 
for number theory since many Galois groups Gk, p discussed above are known to be Golod- 
Shafarevich, and the fact that these groups have free pro-p subgroups conjecturally implies 
that they do not have faithful linear representations over pro-p rings. Very recently, in 
[EJ2], it was shown that every Golod-Shafarevich abstract group has an infinite quotient 
with Kazhdan's property (T), which implies that Golod-Shafarevich abstract groups cannot 
be amenable. The proof in |EJ2j was based, among other things, on an earlier work [Erl| . 
which established the existence of Golod-Shafarevich groups with property (T). The latter 
result, originally obtained as a counterexample to a conjecture of Zelmanov |Ze2| . turned out 
to have many other applications in geometric group theory. 

1.5. Counterexamples in group theory. As we already mentioned, Golod-Shafarevich 
groups gave the first counterexamples to the general Burnside problem, which remained 
open for 60 years. Just a few years later, Novikov and Adyan [NA| gave a very long and 
technical proof of the fact that free Burnside groups of sufficiently large odd exponent are 
infinite, thereby providing the first examples of infinite finitely generated groups of bounded 
exponent (and thus solving THE Burnside problem) . Another construction of infinite finitely 
generated torsion groups, very different from [GSJ and [NA| . was given by Grigorchuk |Gr| 
- these were also the first examples of groups of intermediate word growth. In addition, in 
the 80's, powerful methods had been developed to produce various kinds of infinite torsion 
groups with extremely unusual finiteness properties, starting with Ol'shanskii examples of 
Tarski monsters [Oil] and continuing with even wilder examples constructed using the theory 
of hyperbolic and relatively hyperbolic groups (see, e.g.. [013|. IQslj ). In view of this, Golod- 
Shafarevich groups had been somewhat overshadowed as a potential source of exotic examples. 
However, in the last few years Golod-Shafarevich groups reappeared in this context and were 
used to solve several interesting problems where generally more powerful techniques from the 
area of hyperbolic groups are not applicable. For instance, the existence of Golod-Shafarevich 
groups with property (T) yielded the first examples of torsion non-amenable residually finite 
groups |Erl| . In [E J3] . Golod-Shafarevich groups were used to produce the first residually 
finite analogues of Tarski monsters. In § El we will discuss several other applications of this 
kind as well as a very general technique for discovering new such results. 

1.6. Generalizations, relatives and variations of Golod-Shafarevich groups. A lot 

of attention in this paper will be devoted to generalized Golod-Shafarevich groups abbreviated 
as GGS groups (Golod-Shafarevich groups will be abbreviated as GS groups). Generalized 
Golod-Shafarevich groups are defined in the same way as Golod-Shafarevich groups except 
that generators are allowed to be counted with different weights. They have been introduced 
(without any proper name attached) shortly after Golod-Shafarevich groups (for instance, 
they already appear in Koch's book [Kol| first published in 1970), and it is easy to extend 
all the basic properties of GS groups to GGS groups. However, GGS groups have not been 
used much until recently, when it became clear that the class of GGS groups is more natural 
in many ways than that of GS groups. In particular, GGS groups played a key role in the 
construction of Kazhdan quotients of GS groups [EJ2| and residually finite analogues of Tarski 
monsters [E J3| . 
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Another interesting class of groups which strongly resemble Golod-Shafarevich groups both 
in their definition and their structure was introduced very recently by Schlage-Puchta [SP] 
and independently by Osin [Os2j. In the terminology of [SP], these are groups of positive 
p-deficiency (we will use the term power p- deficiency instead of p-deficiency to avoid termino- 
logical confusion). Perhaps, the most striking fact about these groups is that they provide by 
far the most elementary solution to the general Burnside problem (discovered almost 50 years 
after the initial counterexamples by Golod). These groups will be discussed and compared 
with Golod-Shafarevich groups in § [9j 

Finally, we should make a remark about the term 'Golod-Shafarevich groups'. Even though 
these groups have been studied for almost 50 years, there seemed to be no consensus on what 
a 'Golod-Shafarevich group' should mean until the last few years, and it was common for this 
term to have a more restricted meaning (say, pro-p groups for which r(G) < d(G) 2 /4 and 
d(G) > 1) or even the opposite meaning (that is, groups which are not Golod-Shafarevich in 
our terminology). It is also common, especially in older papers, to talk about Golod groups 
- these usually refer to the class of p-torsion groups constructed by Golod in [Golj . These 
groups are defined as certain subgroups of Golod-Shafarevich graded algebras, but it is not 
clear whether the groups themselves are Golod-Shafarevich since they are not defined directly 
by generators and relators. Thus, a general theorem about Golod-Shafarevich groups does 
not formally apply to Golod groups, but in most cases the corresponding result for Golod 
groups can be obtained by using essentially the same argument. 

Acknowledgments: The author is grateful to Andrei Jaikin for useful discussions and 
suggestions and to Ashley Rail for carefully reading the paper, providing useful feedback 
and proposing Problem [5] in § [T4l The author would also like to thank Mark Sapir and the 
anonymous referee for helpful suggestions. 

2. Golod-Shafarevich inequality 

2.1. Golod-Shafarevich inequality for graded algebras. Let K be a field, U = {u\ , . . . , Ud] 

a finite set, and denote by K{U) = K(u\, . . . ,Ud) the free associative -fT-algebra on U, that 
is, the algebra of polynomials in non-commuting variables U\,... ,Ud with coefficients in K. 
Let K(U) n be the degree n homogeneous component of K(U), so that 

K(U) = (B™ =0 K(U) n . 

Let R be a subset of K(U) consisting of homogeneous elements of positive degree, and let 
A be the K- algebra given by the presentation (U,R). This means that 

A = K(U)/I, 

where / is the ideal of K(U) generated by R. 

Note that / a graded ideal, that is, / = @I n where I n = I H K(U) n , and A is a graded 
algebra: A = ®A n where A n = K(U) n /I n . Let a n = dim^- A n . 

For each n £ N let r n be the number of elements of R which have degree n. Since K{U) n 
is a finite-dimensional subspace, we can (and will) assume without loss of generality that 
r n < oo for each n G N. 

The sequences {a n } and {r n } are conveniently encoded by the corresponding Hilbert series 
HilbA(t) = ^2^=0 CL n t n and Hn(t) = Y^=i r nt n - The following inequality relating these two 
series was established by Golod and Shafarevich |GSj . 

Given two power series f(t) = ^ f n t n and g(t) = ^ g n t n in R[[i]], we shall write f(t) > g(t) 
if fn > 9n for each n. 



GOLOD-SHAFAREVICH GROUPS: A SURVEY 



6 



Theorem 2.1 (Golod-Shafarevich inequality: graded case). In the above setting we have 

(2.1) (1 - |l/|t + H R (t)) ■ Hilb A (t) > 1. 

Proof. Even though the proof of this result appears in several survey articles and books (see, 
e.g., (Hal Section 5]), we present it here as well due to its elegance and importance. 

Let R n = {r G R : deg (r) = n}, so that R = U n >ii? n . Recall that / is the ideal of K(U) 
generated by R and I = ®^ =l I n with I n C K(U) n . 

Now fix n > 1. Since each r G R is homogeneous, I n is spanned over K by elements of the 
form vrw for some v G K{U) s ,w G K{U)t and r G i? m , where s + m + t = n and v and w 
are monic monomials. 

If v ^ 1, then v = tit/ for some u G U, so vrw = uv'rw G u/ n _i. If v = 1, then 
uru> = rw £ R m K(U) n - m . Hence 

n 

J n = span^([/)/ n _i + ^ span E -(i2 m )ir([/) n _ m . (* * *) 

m=l 

For each i G Z>o choose a -ftT-subspace i?i of K(U)i such that K(U)i = Ii ® Bi. Then 
span ftr (i? m )i ; r(C/) 

n—m — span^- (_R m ).B ra _ m -|-span^- (R m ^I n — m and span^-(^? m )/ ra _ m C span E -(C/)/ n _i. 
Combining this observation with (***), we conclude 

n 

(2.2) I n = span K (U)I n -i + ^ sp an^- ( R m ) B n _ m . 

m=l 

Let ci = |J7|. Since Ai = K(U)i/Ii, we have = dim^j = K(U)i — dim/j = d % — dim/j, and 
thus dimSj = a^. Hence, computing the dimensions of both sides of (|2.2p . we get 

n 

d n - a n < d(d n ^ 1 - a n _i) + ^ r m a n _ m , 

m=l 

which simplifies to a n — <ia n _i + Yl m =i r m a n-m > 0. 

Finally observe that a n — da n -\ + Ylm=l r ma n -m is the coefficient of t n in the power series 
(1 — dt + Hfi(t)) ■ HilbA{t). The constant term of this power series is oq = 1. Therefore, we 
proved that (1 — dt + Hji(t)) ■ HilbA(t) > 1 + Ylm=i • = 1 as power series, as desired. □ 

As an immediate consequence of the Golod-Shafarevich inequality one obtains a sufficient 
condition for a graded algebra A given by a presentation (U, R) to be infinite-dimensional: 

Corollary 2.2. Assume that there exists a real number r > s.t. 1 — dr + Hr(t) < (in 
particular, we assume that the series Hr(t) converges). Then 

(i) The series Ha(t) diverges. 

(ii) Assume in addition that r G (0, 1) and 1 — dr + Hr(t) < 0. Then the algebra A 
has exponential growth, that is, the sequence a n = dim^4 n grows exponentially. In 
particular, A is infinite- dimensional. 

Proof, (i) Suppose that the series Ha(t) converges. Then, if we substitute t = r in (|2.ip . both 
factors on the left-hand side of (|2.ip become convergent, so we should get a valid numerical 
inequality Ha(t)(1 — dr + Hr(t)) > 1. This cannot happen since clearly Ha(t) > 0, while 
by assumption 1 — dr + Hji(t) < 0. 

(ii) Since the series Ha{t) diverges, we must have limsup ^fd~^ > ~ > 1. On the other hand, 
since by construction A is generated in degree 1, the sequence {a n } is submultiplicative (that 
is, a n + m < a n a m for all n, m), which implies that lim ^/a^ exists. Therefore, lim ^fa^ > 1, 
so {a n } grows exponentially. □ 
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Corollary 2.3. Let A be a finite- dimensional graded K -algebra, and let (U, R) be a graded 
presentation of A, which is minimal in the sense that no proper subset of U generates A. 
Then 

(2.3) \R\ > \U\ 2 /A. 

Proof. We first note that r\ = 0, that is, R has no relators of degree 1, since any such relator 
would allow us to express one of the generators in U as a linear combination of the others, 
contradicting the minimality of the presentation (U,R). Therefore, for any r > we have 
1 - \U\t + H r {t) < 1 - \U\t + \R\t 2 . 

On the other hand, since A is finite-dimensional, by Corollary 12.21 for any r > we have 
1 - \U\t + H r (t) > 0. Thus, 1 - \U\t + |i?|r 2 > for any r > 0, and setting r = 2/\U\, we 
obtain \R\ > \U\ 2 /4. □ 

Remark: Minimality of U is actually equivalent to the assumption r\ = 0. Note that it is 
necessary to make this assumption in Corollary 12.31 - without it we could start with any finite 
presentation for A and then add any set of "artificial" generators S together with relations 
s = for each s 6 S, violating (j2.3j) for sufficiently large S. 

It is a very interesting question whether inequality (\2.3\i is optimal. More generally, fix a 
field K, and given an integer d > 2, let f(d) be the smallest positive integer for which there 
exists a subset R of K(u%, ■ ■ ■ ,Ud) consisting of homogeneous elements of degree at least 2, 
with \R\ = f(d), such that the if-algebra {u%, . . . , \ R) is finite-dimensionalQ 

Corollary 12.31 implies that f{d) > d 2 /4, and there is an obvious upper bound f{d) < 
(d 2 + d) 1 2 yielded by any commutative algebra in which each Uj is nilpotent. That this upper 
bound is not optimal was immediately realized by Kostrikin [Kos] in 1965 who showed that 
f(d) is at most (d 2 — l)/3 + d, at least when d is a power of 2. In 1990, Wisliceny [Wis2] 
found a better upper bound which asymptotically coincides with the Golod-Shafarevich lower 
bound: f(d) < ^ + 1 if d is even and f{d) < + 1 if d is odd. The final improvement was 

obtained very recently by Iyudu and Shkarin |IySh2| who proved that f(d) <] d [ (where 
]x[ is the smallest integer greater than or equal to x). Examples of presentations yielding 
this bound (as well as examples from |Wis2| ) are of very simple form, with every relation of 
the form UiUj = u^ui for some indices i,j,k and /. Algebras given by presentations of this 
form are called quadratic semigroup algebras in |IySh2], where it is proved that the bound 
fid) <] d2 +d+ l [ j s optimal for this class of associative algebras. 

Another natural problem is when the Golod-Shafarevich inequality (|2.ip becomes an equal- 
ity. Anick showed that this happens if and only if the set of relators R is strongly free [Anil 
Theorem 2.6], and if we assume that R is a minimal set of relators, then R is strongly free if 
and only if the algebra A = (U\R) has global dimension < 2 (see |AnH Theorem 2.12]); see 
also [Pij for some related results. An easily verifiable sufficient condition for R to be strongly 
free is that the set of (suitably defined) leading terms of elements of R is combinatorially free 
(see [Anil Theorem 3.2]). Stronger results on this problem have been obtained for the class of 
quadratic algebras, that is, algebras given by a presentation with all relations homogeneous 
of degree two - see [An2t IyShl| and the books [Uf] and [PPJ. 



^It is not known whether the function /(d) depends on K. 

2 The paper [Kosj gives a construction of finite p-groups with the corresponding bound on the number of 
relators, but its easy modification yields the analogous result for associative algebras. 
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2.2. First applications of the Golod-Shafarevich inequality. We now discuss two ma- 
jor applications of the Golod-Shafarevich inequality - the (negative) solutions to the Kurosh- 
Levitzky and the general Burnside problems. 

Problem (Kurosh-Levitzky). Let K be a field. Is it true that a finitely generated nil algebra 
over K must be finite- dimensional (and hence nilpotent)? 

Theorem 2.4 (Golod, [Goll lGo2]). Let K be a field and d > 2 an integer. Then there exists 
a d-generated associative nil algebra over K which is infinite- dimensional. 

Proof. We start with the case when K is countable established in [Gol| . where the argument 
is very simple. Let U = {u\, . . . ,Ud} and K(U) + the subset of K{U) consisting of all poly- 
nomials with zero constant term. Since K is countable, K{U} + is also countable, so we can 
enumerate its elements: K(U) + = {/i, /2, • • •}• 

Let r G (1/d, 1) and JVgN be such that l-dr+Y^n=N r ™ < °- Choose Ni > N, and write 
the element Z^ 1 as the sum of its homogeneous components: Z^ 1 = X^=i/i,i- Note that 
deg (/i,t) > Ni since f\ has zero constant term. Next choose N2 > max{A r i + 1, {deg (/i,i)}}, 
and let {f2,i}i=i be the homogeneous components of f^ 2 - Next choose N3 > max{iV2 + 
1, {deg (/2,i)}} an d proceed indefinitely. 

Let R = {f n ,i ■ n G N, 1 < i < k n }, consider the algebra A = (U\R), and let A + be 
the image of K(U) + in A. By construction, the algebra A + is (i-generated and nil and has 
codimension 1 in A, so we only need to prove that A is infinite-dimensional. The choice of 
{Ni} ensures that the set of relations R contains at most one element of each degree and no 
elements of degree less than N. Therefore, 1 — \ U\r + Hr{t) < 1 — dr + Y1^=n < 0, so vl 
is infinite-dimensional by Corollary 12.21 

General case: Let be the set of all non-empty finite sets of monic [/-monomials of positive 
degree (this is clearly a countable set). Given w G 0, let S u be the set of all elements of 
K{U) + which are representable as a iT-linear combination of elements of to and u is the 
smallest set with this property. Thus, K(U) + = U^^nS^. 

Now fix uj = {mi,...,m„} G £1. Every element / G S w can be written as a sum 
/ = c i m i where Cj's are elements of K, which for the moment we treat as commut- 

ing formal variables. Given N G N, we have f N = Ylt=i Mi,w,Jv(ci, • • • , Cn)Pi,u>,N where each 
Mi,cj,iv(ci, • • • , c n ) is a monic monomial in c%, . . . , c n of degree N, djv> is the number of distinct 
such monomials, and each Pi tUJ) N is a polynomial in U with no terms of degree < N. Here 
are two key observations: 

(i) Fix oj G O and A'dN. If a set R contains all homogeneous components of Pi tU j,N for 
each 1 < i < djy tU1 , then the image of every element of in the algebra A = (U\R) 
will be nilpotent. 

(ii) The sequence djv,w grows polynomially in N (as uj stays fixed) since Cj's commute 
with each other. 

Now fix r G (1/d, 1), for each uj £ Q choose N u G N, and let R be the set of all (nonzero) 
homogeneous components of the polynomials Pi jW ,Ar w with uj G £1 and 1 < % < djv w ,cj- Let 
A = (U\R) and A + the image of K(U) + in A. 

Property (i) ensures that A + is nil. Since homogeneous components of Pi tUJt N u have degree 
> N u , we have 

h r (t) < 22 d N„,Lo 22 t3 = 2_/ 

wesi j=N bJ uien 
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Property (ii) ensures that each term of this sum can be made arbitrarily small by choosing 
sufficiently large N u (independently of other terms). In particular, we can make sure that 
1 — dr + Hr(t) < 0, so that A is infinite-dimensional, and we are done as in the case of 
countable K. □ 

Once Theorem 12.41 is proved, it is very easy to show that the general Burnside problem 
also has negative solution. 

Theorem 2.5 (Golod, |Gol| ). For every prime p and integer d > 2 there exists an infinite 
d-generated p-torsion group. 

Proof. Let K = ¥ p , the finite field of order p, and let A + be the algebra constructed in 
the proof of Theorem 12.41 Since A + is nil and K has characteristic p, the set 1 + A + = 
{1 + a : a G A + } is a p-torsion group. We claim that the subgroup T of 1 + A + generated by 
1 + «!,..., 1 + Ud is infinite (and hence satisfies all the required properties). The inclusion 
map i : T — > 1 + A + induces a homomorphism : F p [T] — > A (where F p [r] is the F p -group 
algebra of T). The image of contains 1 and 1 + Ui for each i (hence also Ui for each i), so 
is surjective. Since A is infinite-dimensional, so must be F p [r], whence T is infinite. □ 

Note that while the above construction of infinite finitely generated p-torsion groups uses 
very elementary tools, it has one disadvantage as we have no control over presentations of 
r by generators and relators. This problem will be addressed in the next section where we 
will provide an alternative version of Golod's construction (which uses Golod-Shafarevich 
groups), based on a more general form of the Golod-Shafarevich inequality. 

2.3. Golod-Shafarevich inequality for complete filtered algebras. In order to de- 
fine and study Golod-Shafarevich groups, one needs a more general version of the Golod- 
Shafarevich inequality dealing with complete filtered algebras. Below we shall essentially 
repeat the setup of § 12.11 with two changes: polynomials are replaced by power series and 
relators are allowed to be non- homogeneous. 

As in § 12.11 we fix a finite set U = {u±, . . . , u^}, a field K, and let K((U}) = K((u±, . . . , uj)} 
be the algebra of power series over K in non-commuting variables u\, . . . ,Ud- As usual, given 
/ G K((U)), we define deg (/) to be the smallest length of a monomial in U which appears 
in / with nonzero coefficient. For convenience we also set deg (0) = oo. Let K{{U)) n = {/ G 
K((U)) : deg (/) > n}. The sets {K((U)) n } n ^ form a base of neighborhoods of for the 
natural degree topology on K((U)). 

Let 7 be a closed ideal of K((U))i, let R C / be a subset which generates I as a (closed) 
ideal, and let r n = \{r G R : deg (r) = n}\. As before, without loss of generality, we can 
assume that r n < oo for each n. 

Let A = K{(U))/I. Note that A is no longer a graded algebra, but it has a natural 
descending filtration {A n } n >o where A n = ir(K((U)) n ) and ir : K((U)) — > A is the natural 
projection. Since A is also complete with respect to topology determined by the filtration 
we will refer to such algebras as complete filtered algebras. 

Define a n = dim^- A n /A n+ i, and as in the graded case consider the Hilbert series HilbA(t) = 

Zn=0 Vnt n and H R (t) = Zn=l ^ ■ 

Theorem 2.6 (Golod-Shafarevich inequality: general case). In the above setting we have 



(2.4) 



(l-\U\t + H R {t))-Hilb A {t) > _L_ 
l-t ~ 1-t 
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Inequality (|2.4p was first proved by Vinberg [Vi] (see inequality (12) on p. 212). @ Its proof 
is similar to (but more technical than) the one in the graded case. This time it is not possible 
to give a good lower bound for a n , but one can still give a bound for dim A/A n+ \ = clq+. . .+a n 
which is the coefficient of t n in the power series _ 

Also note that inequality (|2.4|) follows from the one we had in the graded case by multi- 
plying both sides by the power series with positive coefficients. While inequality (12, 4h is 
weaker than (|2.ip . all the key consequences of (|2.ip established earlier in this section remain 
true in this setting: 

Proposition 2.7. Let A be a complete filtered algebra given by a presentation (U,R). 

(a) Assume that there exists a real number r G (0, 1) s.t. 1 — \ U\t + Hr(t) < 0. Then A 
is infinite- dimensional and HilbA(T) diverges. If 1 — | U\ t + Hr(t) < 0, the sequence 
{a n } (defined above) has exponential growth. 

(b) If A is finite- dimensional and U is minimal, then \R\ > |[/| 2 /4. 

Definition. 

(i) A presentation (U, R) in the category of complete filtered algebras will be said to 
satisfy the Golod-Shafarevich (GS) condition if 1 — \U\r + Hr{t) < for some t £ 
(0,1). 

(ii) A complete filtered algebra A will be called Golod-Shafarevich if it has a presentation 
satisfying the GS condition. 

3. Golod-Shafarevich groups 

3.1. Definition of Golod-Shafarevich groups. Fix a prime number p, and let G be a 

finitely generated pro-p group. Recall that G = ^m N( - Q ^ G/N, where £l p (G) is the set of 

open normal subgroups of G (all of which have p-power index). We shall be interested in 
the completed group algebra F p [[G]] which is defined as the corresponding inverse limit of 
Fp-group algebras: 

¥ p [[G}]= hm ¥ P [G/N]. 
Nen p (G) 

Suppose now that G is given in the form G = F/(R) F where F is a finitely generated free 
pro-p group, R is a subset of F (and (R) F is the closed normal subgroup of F generated by 
R). Then it is easy to see that there is a natural isomorphism F P [[G]] = F p [[F]]//r where Ir 
is the closed ideal of F p [[F]] generated by the set {r — 1 : r £ R}. 

Let X = {x\, . . . , Xd} be a free generating set of F. By a theorem of Lazard [Laz| . the com- 
pleted group algebra F P [[F]] is isomorphic to the algebra of power series ¥ p ((ui, . . . , Ud}} under 
the map Xi i- > 1 + U{. Note that this map yields an embedding of F into ¥ p ((ui, . . . ,Ud)} x , 
the multiplicative group of ¥ p ((ui, . . . ,Ud}}- This embedding is called the Magnus embedding 
(it was initially established by Magnus |Maj in the case of free abstract groups). 

The bottom line of the above discussion is that given a presentation (X, R) of a pro-p 
group G, there is a corresponding presentation for the completed group algebra F P [[G]] as 
a quotient of ¥ P ((U)) (with \U\ = \X\). A pro-p group G will be called Golod-Shafarevich 
if it has a presentation such that the corresponding presentation of F P [[G]] satisfies the GS 
condition. 

•^Formally, [Vi| deals with polynomials and not power series, but this makes no difference as explained in 
the paragraph just before Theorem 2 in [Vi| . Similarly, the extra assumption r 1 — made in [Vj is not 
essential for the proof. 
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Definition. Let X = {x\, . . . , Xd} and U = {u\, . . . , Ud} be finite sets of the same cardinality. 
Let F = Fp(X) the free prop group on X, and let i : F — > F P ((U)) x be the Magnus embedding. 
Define the degree function D:F->NU {00} by 

£>(/) = deg( t (/)-l), 

where deg is the usual degree of a power series in tti, . . . , Ud- 

Definition. 

(i) A (pro-p) presentation (X, R) is said to satisfy the Golod-Shafarevich ( GS) condition 
if there exists r G (0, 1) such that 1 - \X\t + H r (t) < where H R (t) = E reK t D(r) . 

(ii) A pro-p group G is called a Golod-Shafarevich ( GS) group if it has a presentation 
satisfying the GS condition. 

(iii) An abstract group G is called a Golod-Shafarevich group (with respect to p) if its 
pro-p completion Gp is Golod-Shafarevich. 

Remark: It is more common to call an abstract group G Golod-Shafarevich if it has an 
abstract presentation (X,R) s.t. 1 — \X\t + Hr(t) < for some r G (0, 1). This condition is 
certainly sufficient for G to be Golod-Shafarevich in our sense since if an abstract group G is 
given by a presentation (X, R), then its pro-p completion G^ is given by the same presentation 
(X, R), considered as a pro-p presentation (see, e.g., [Lul4 Lemma 2.1]). To the best of our 
knowledge, it is an open question whether these two definitions of Golod-Shafarevich abstract 
groups are equivalent. The advantage of our definition is that an abstract group G is Golod- 
Shafarevich if and only if the image of G in its pro-p completion is Golod-Shafarevich. 

Theorem 3.1 (Golod-Shafarevich). Golod-Shafarevich groups are infinite. 

Proof. If G is a Golod-Shafarevich pro-p group, then by construction F P [[G]] is a Golod- 
Shafarevich algebra, hence infinite. This implies that G has infinitely many open subgroups, 
so G must be infinite. If G is a Golod-Shafarevich abstract group, its pro-p completion is 
infinite, as we just argued, so G itself must be infinite. □ 

Before discussing applications of Golod-Shafarevich groups, we remark that the degree 
function D used above can also be described in terms of the Zassenhaus filtration which 
makes perfect sense in arbitrary (not just free) groups. 

Definition. Let G be a finitely generated abstract (resp. pro-p) group. Let M be the 
augmentation ideal of the group algebra ¥ p [G] (resp. completed group algebra F p [[G]]), that 
is, M is the ideal generated by the set {g — 1 : g € G}. For each n 6 N let D n G = {g G G : 
g — 1 £ M n }. The series {D n G} is called the Zassenhaus filtration of G. 

If F is a finitely generated free pro-p group and / G i ? \{l}, it is easy to see that D(f) = n 
if and only if / G D n F \ D n+ iF. In particular, this shows that the function D does not 
depend on the choice of a free generating set X of F. 

It is well known (see, e.g., [DDMSI Ch. 11,12]) that the terms of Zassenhaus filtration can 
also be defined as verbal subgroups: 

Proposition 3.2. The Zassenhaus filtration {D n G} can be alternatively defined by 

D n G= n ( 7l G)P\ 

i-pi>n 
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3.2. First applications of Golod-Shafarevich groups. As an immediate consequence of 
Theorem 13. 1\ we can construct infinite finitely generated torsion groups, which are explicitly 
given by generators and relators. The argument below is very similar to (and actually simpler 
than) the one used in the solution to the Kurosh-Levitzky problem over countable fields. To 
the best of our knowledge, this argument was first used by Wilson in [Wil]. 

Theorem 12.51 (Golod). For every prime p and integer d > 2 there exists an infinite d- 
generated p-torsion group. 

Second proof of Theorem \2.5[ Let X be any finite set with \X\ = d and F = F(X) the free 
group on X. Since F is countable, we can enumerate its elements F = {/i, /2, . . .}. Now take 
a sequence of integers n\ , , . . . , and consider a group G = (X\R) where R = { f\ 1 , f\ 2 , • • • } • 
By construction, G is p-torsion. 

k k 

It is easy to see that D{f p ) = D(f) p for any / G F, so for any r G (0, 1) we have 

oo 

1 - \X\t + H r (t) < 1 - \X\t + rpni ■ 

1=1 

Now fix r G (1/|X|, 1). Then 1 — \X\t < 0, so we can choose the sequence {n{\ such that 
tP < — (1 — |-X"|t). Then G will be Golod-Shafarevich and therefore infinite. □ 

The following stronger version of Theorem 13.2} also due to Golod, was announced in |Gol| 
and proved in |Go2| . 

Theorem 3.3 (Golod). For every prime p and integer d > 2 there exists an infinite d- 
generated p-torsion group in which every (d — l)-generated subgroup if finite. 

Remark: Theorem 13.31 was deduced in [Go2| from the corresponding result for graded 
algebras (which can also be found in the book by Kargapolov and Merzlyakov }KaMe| ) in 
essentially the same way Theorem 12.51 follows from Theorem 12.41 Below we give a "direct" 
group-theoretic proof of this result, generalizing the argument in the second proof of Theo- 
rem [53J 

Proof of Theorem \ 3. 31 Take any set X with \X\ = d, and construct a p-torsion Golod- 
Shafarevich group G = (X\R) as in the second proof of Theorem 12.51 with the extra re- 
quirement that 1 — \X\t + Hr{t) < for some r G (^, jpj). Let e = —(1 — \X\r + Hr(t)) 
and 5 = r(d — 1). Since 5 < 1, we can find an integer sequence {mi} such that YliLi < £ - 

Now let Cl = {a/ 1 ), uj^ 2 \ . . .} be the set of all ordered (d — l)-tuples of elements of F(X) 
(the free group on X) listed in some order. If cjW = (f^\ . . . , f^_i), let Ri be the set of all 
left-normed commutators of length nil involving /^ , . . . , f^}_ v 

We claim that the group G' = (X\RU Ui>i Ri) nas the required properties. By construc- 
tion, every (d— l)-generated subgroup of G' is nilpotent and p-torsion (since G' is a quotient 
of G) and hence finite. If c = [y±, . . . ,y m ] is a left-normed commutator of length m, then 
D(c) > D{ V1 ) + ... + D(y m ). Therefore, if S t = {(yx, . . . , y mi ) : Vj G {/f , . . . , /^J}, then 

d-l 

H Ri {t) < Yj t d ^ + - +d ^ < (J^r D(/ / < {T{d-l)) m * = 5 m \ 
(yi,-.,ym t )eSi j=l 

Hence 1 — \X\t + Hr(t) + ^«>i Hr^t) < 0, so G' is Golod-Shafarevich, hence infinite. □ 
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We now turn to the proof of the famous inequality \R\ > \X\ 2 /A for finite p-groups which 
was used in the solution of the class field tower problem. Recall that for a pro-p group G we 
denote by d(G) and r(G) the minimal number of generators and relators of G, respectively. 

Theorem 3.4. Let p be a prime. 

(a) Let G be a finitely presented pro-p group such that r(G) < d{G) 2 /A. Then G is 
infinite. If in addition r(G) < d(G) 2 /A and d(G) > 1, then G is Golod-Shafarevich. 

(b) LetT = (X\R) be a finitely presented abstract group, andletd p (T) = dim^ (r/[r, r]T p ). 
// \R\ < dpiT) 2 /4 — d p (T) + \X\ and d p (T) > 1, then T is Golod-Shafarevich (with 
respect to p). 

Lemma 3.5. Let (X,R) be a presentation of a pro-p group G, with X finite, F = F^(X) 
and 7r : F — > G the natural projection. The following hold: 

(i) |X| = d(G) if and only if R lies in &(F) = [F,F]F P , the Frattini subgroup of F 
(which holds if and only if D(r) > 2 for all r G R). Moreover, R contains at least 
\X\ — d{G) elements of degree 1. 

(ii) Assume that R is finite. Then G has a presentation with d{G) generators and \R\ — 
\X\ + d{G) relators. More precisely, there exists a subset X' of X, with \X'\ = d{G) 
and a subset R' of R with \R'\ = \R\ — \X\ + d(G) with the following properties: 
7r(F(X')) = G and if 9 : F^(X) — > Fp(X') is the unique homomorphism which acts as 
identity on X' and sends X\X' to 1, then 9{R') generates Ker7rn-F(X') as a closed 
normal subgroup of F(X'), and thus {X' ,0(R')) is a presentation of G. 

Proof. Part (i) easily follows from the fact that d{G) = d(G/[G, G]G P ). Part (ii) follows from 
the proof of |Wi21 Prop. 12.1.5]; the first assertion of (ii) is also proved in [Lull Lemma 1.1]. 

□ 

Proof of Theorem \3.4\ In view of Lemma l3.5f i). (a) can be proved by the same argument as 
Corollary 12.31 For part (b) let G = be the pro-p completion of G. Then d(G) = d p (T), 
and the result follows from (a) and Lemma l3.5f ii). □ 

As in the case of graded algebras, given d E N, it is natural to ask what is the minimal 
number f(d) for which there exists a finite p-group G with d(G) = d and r(G) = f(d). The 
best currently known bound is due to Wisliceny [Wislj who proved that f(d) + f [ (note 
that this coincides with the corresponding bound for graded algebras from [Wis2] obtained 
several years later). 

3.3. Word growth in Golod-Shafarevich groups. In view of our informal statement 
"Golod-Shafarevich groups are big" , it is natural to expect that GS abstract groups at least 
have exponential growth. In fact, more is true: Golod-Shafarevich groups always have uni- 
formly exponential growth, and this fact has a surpsingly simple proof. 

Proposition 3.6 ( |BaGrj ). Golod-Shafarevich abstract groups have uniformly exponential 
growth. 

Proof. Let T be a GS group with respect to a prime p and M the augmentation ideal of F p [r]. 
If G = F p is the pro-p completion of T and M is the augmentation ideal of F P [[G]] , it is easy to 
show for that for every n G N the natural map F p [r]/M n — > ¥ P [[G]]/ M n is an isomorphism. 
Thus, by Proposition I2.7f a). the sequence a n = dimp p F p [r]/M n+1 grows exponentially in n. 
Now let X be any generating set of T. Then F p [r]/M n+1 is spanned by products of the form 
(1 -xi)(l -x 2 )...(l — x m ) with < m < n and X{ G X. Each such product lies in the 
Fp-span of L>x{n), the ball of radius n with respect to X in T. Hence \L>x(n)\ > a n . □ 
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It is now known that Golod-Shafarevich groups are uniformly non-amenable [EJ2} Ap- 
pendix 2] (which strengthens the assertion of Proposition 13. 6p . but the proof of this result 
is much more involved. We will discuss the proof of non-amenability of Golod-Shafarevich 
groups in § PT2l3. 

3.4. Golod-Shafarevich groups in characteristic zero. In this subsection we will briefly 
discuss groups which are defined in the same way as Golod-Shafarevich groups in § 13. 1|. except 
that F p will be replaced by a field of characteristic zero. Unlike the positive characteristic 
case, we will begin the discussion with abstract groups. 

Let K be a field of characteristic zero, X = {x%, . . . , x^} and U = {u%, . . . , Ud} finite sets 
of the same cardinality. The map Xi — > 1 + Ui still extends to an embedding of the free group 
F(X) -> K{{U)) X . As in § EU we define the degree function D : F(X) ->NU {oo} by 

A)(/) = deg(/-l) for aRf€F(X). 

Again, the function Dq admits two alternative descriptions, one in terms of the augmentation 
ideal and another one in terms of the lower central series, which replaces the Zassenhaus 
filtration. 

Proposition 3.7. Let F = F(X). Given f G and n G N, the following are equivalent: 

(i) D (f)=n; 

(ii) / — 1 G M n \ M n+1 where M is the augmentation ideal of K[F]; 

(iii) / G 7n F \ ~/ n+1 F. 

Definition. Let T be a finitely generated abstract group. We will say that T is Golod- 
Shafarevich in characteristic zero if there is a presentation T = (X\R) s.t. 1 — \X\t + 
£refl rA)(r) < for some r G (0, 1). 

Observation 3.8. If an abstract group T is GS in characteristic zero, then V is GS with 
respect to p for any prime p. 

Proof. Let F be a finitely generated free group and D the degree function on F coming from 
the Magnus embedding in characteristic p (as defined in § 13. 1|) . Then Do(f) < D(f) for any 
/ G F by Propositions 13.2 1 and 13.71 This immediately implies the result. □ 

In view of Observation 13. 8( any result about GS (abstract) groups with respect to a prime 
p automatically applies to GS groups in characteristic zero. Somewhat surprisingly, nothing 
much beyond that seems to be known about GS groups in characteristic zero. It will be very 
interesting to prove some results about GS groups in characteristic zero (which do not apply 
to or not known for GS groups with respect to a prime p) since the former class includes 
some important groups, e.g. free-by-cyclic groups with first Betti number at least two. 

The counterparts of GS pro-p groups in characteristic zero are Golod-Shafarevich prounipo- 
tent groups. Let K be a field of characteristic zero. A prounipotent group over K can be 
defined as an inverse limit of unipotent groups over K. Given a natural number d, the free 
prounipotent group of rank d over K, denoted here by Fj>(d), can be defined as the closure (in 
the degree topology) of the subgroup of K((ui, . . . , Ud)) x , generated by the elements (1 + u«) A 
for all A G K (where by definition (1 + Uj) A = J2m=o im) u T> wn i cn makes sense since K 
has characteristic zero). Thus, the function Dq originally defined on free abstract groups can 
be naturally extended to free prounipotent groups, and one can define Golod-Shafarevich 
prounipotent groups in the same way as Golod-Shafarevich groups in characteristic zero, re- 
placing abstract presentations by presentations in the category or prounipotent groups over 
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K. If r is a GS abstract group in characteristic zero, its prounipotent completion is a GS 
prounipotent group, but it is not clear whether the converse is true. 

The general theory of prounipotent groups as well as the theory of Golod-Shafarevich 
prounipotent groups was developed by Lubotzky and Magid in [LuMal^ ILuMa2l ILuMa3j . 
Another notable result about Golod-Shafarevich prounipotent groups is due to Kassabov [Ka] 
who proved that they always contain non-abelian free prounipotent subgroups - this is a 
characteristic zero analogue of Zelmanov's theorem [Zel] discussed in § [7J 

4. Generalized Golod-Shafarevich groups 

In this section we introduce a more general form of the Golod-Shafarevich inequality and 
define the notion of generalized Golod-Shafarevich groups. Similarly one can define general- 
ized Golod-Shafarevich algebras (graded or complete filtered) - see the end of § 14.21 

4.1. Golod-Shafarevich inequality with weights. In this subsection we essentially re- 
peat the setup of § 12.31 with two differences: 

(i) generators will be counted with (possibly) different weights; 

(ii) the number of generators will be allowed to be countable. 

By allowing countably many generators we will avoid some unnecessary restrictions in 
many structural results about generalized Golod-Shafarevich groups. However, all the key 
applications could still be achieved if we considered only the finitely generated case, so the 
reader may safely ignore the few minor subtleties arising from dealing with the countably 
generated case. 

Let K be a field, U = {ui,U2, • • • , } a finite or a countable set and A = ¥ P ((U)). Define a 
function d : A — > M>o U {oo} as follows: 

(i) Choose an arbitrary function d : U — >• M>o, and if U is countable, assume that 
d(u{) — > oo as i — > oo. 

(ii) Extend d to the set of monic [/-monomials by ^(u^ . . . Ui k ) = ^(u^) + . . . + d{ui k ). 
By convention we set d{l) = 0. 

(iii) Given an arbitrary nonzero power series / = ^ c a m a 6 A (where {m a } are pairwise 
distinct monic monomials in U and c a 6 K), we put 

(4.1) d(f) = mm{d(m a ) : c a / 0}. 

Finally, we set <i(0) = oo. 

Definition. 

(i) Any function d obtained in this way will be called a degree function on ¥ p {{U)) with 
respect to U. 

(ii) If U is finite, the unique degree function d such that d{u) = 1 for all u £ U will be 
called standard. 

Given a subset S <^ A such that for each a£l, the set {s £ S : d(s) = a} is finite, we put 

H s<d (t) = Y,t d{s) . 

Note that we do not require that d is integer-valued, so Hs t d(t) is not a power series in general. 
It is easy to see, however, that for any degree function d the set Im (d) of possible values of 
d is discrete. Therefore, we can think of Hs t d(t) as an element of the ring K{{t}} whose 
elements are formal linear combinations Y2a>o c at a where c Q £ K and the set {a : c a ^ 0} is 
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discrete. The latter condition ensures that the elements of -fT{{i}} can be multiplied in the 
same way as usual power series. 
For each a > 0, let 

K((U)) a = {/ G K((U)) : d(f) > a} and K((U)) >a = {/ G K((U)) : d(f) > a}. 

As in § E3 let I be a closed ideal of K((U)) >0 , let A = K((U))/I and R C I a subset 
which generates / as a (closed) ideal and such that r a = \{r G R : d(r) = a}\ is finite for all 
a. Let 7r : K((U)) — > A be the natural projection. For each a > let A a = ir(K{{U)) a ) and 
^4>q = 7r (K((U))>a) an d a a = dim^ ^4 Q / J 4 >Q ,. Finally, define the Hilbert series by 

Hilb A ,d(t) = ^a a t a . 

In order to get a direct generalization of Theorem 14.11 we have to assume that the degree 
function d is integer- valued. 

Theorem 4.1 (Golod-Shafarevich inequality: weighted case). Assume that d is an integer- 
valued degree function. Then in the above setting we have 

(4 2) (1 - H U4 {t) + HrM) ■ Hilb A ,d(t) > 1 



1-t ~ 1-t 

We are not aware of any reference where this theorem is proved as stated above, but 
the proof of Theorem 12.61 extends to the weighted case almost without changes. Further, 
inequality (14. 2D is proved in [Ko2| in a more restrictive setting (see formula (2.11) on p. 105), 
but again the same argument can be used to establish Theorem 14.11 

4.2. Generalized Golod-Shafarevich groups. Let X = {x\, . . . , x m } and U = {u\, . . . , u m } 
be finite sets of the same cardinality, F = F^{X) the free pro-p group on X, and embed F 
into ¥ p ((U)) via the Magnus embedding Xj H > 1 + u\. 

Definition. 

(a) A function D is a called a degree function on F with respect to X if there exists 
a degree function d on F P [[F]] with respect to U = {x — 1 : x £ X} such that 
D(f) = d{f — 1) for all / G F. We will say that D is the standard degree function if 
d is standard (equivalently if D(x) = 1 for all x G X). 

(b) Given a subset S of F we put H StD (t) = E se s* D(s) - 

Now let G be a pro-p group, (X, R) a presentation of G, D a degree function on F = Fp(X) 
with respect to X and d the corresponding degree function on F p [[i 7 ]]. If D is integer- valued, 
Golod-Shafarevich inequality (|4.2h yields the following 

(4 3) (1 - H x ,p(t) + H R ,p(t)) ■ Hilb ¥p[[G]]4 (t) > 1 



1-t ~ 1-t 

Definition. 

(a) A pro-p group G is called a generalized Golod-Shafarevich ( GGS) group if there exists 
a presentation (X, R) of G, a real number r G (0, 1) and a degree function D on 
Fp(X) with respect to X such that 1 — Hx,d(t) + Hr^(t) < 0. 

(b) An abstract group G is called a GGS group (with respect to p) if its pro-p completion 
is a GGS group. 
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Remark: It is clear that a prop group is GS if and only if it satisfies condition (a) for the 
standard degree function D. 

The reader may find it strange that we did not require D to be integer-valued in the 
definition of GGS groups, since this assumption is necessary for (I4.3P to hold. The reason is 
that this would not make any difference: 

Lemma 4.2. ( [E J2t Lemma 2.4]) If (X,R) is a presentation and r £ (0,1) is such that 
1 — Hx,d(t) + Hr,d(t) < for some degree function D on F = Fp(X) with respect to X , 
then there exists an integer-valued degree function D\ on F with respect to X and T\ G (0, 1) 

such that 1 — -Hx.DifTi) + -£Tr,Di (n) < 0- Moreover, we can assume that r x Dl(/) < r D (/) for 
allfeF. 

The proof of this lemma is not difficult, but the result becomes almost obvious when 
restated in the language of weight functions, discussed in the next subsection. 

Thanks to this lemma, we can state the following consequence of (I4.3P without assuming 
that D is integer- valued: 

Corollary 4.3. In the notations of (|4.3p assume that 1 — Hx,d( t ) + Hx,r{ t ) < for some 
t E (0, 1). Then the series Hilb^ ^q\]^{t) is divergent (so in particular, G is infinite). 

All properties of Golod-Shafarevich groups established so far trivially extend to generalized 
Golod-Shafarevich groups. The main reason we are concerned with GGS groups in this paper 
is the following result, which does not have a counterpart for GS groups: 

Theorem 4.4. Open subgroups of GGS pro-p groups are GGS. 

This theorem, whose proof will be sketched in § [5l plays a crucial role in the proofs of some 
structural results about GS groups, so consideration of GGS groups is necessary even if one 
is only interested in GS groups. To give the reader a better feel about GGS groups, we shall 
provide a simple example of a pro-p group, which is GGS, but not GS. 

Proposition 4.5. Let p > 3, let F = Fp{2) be the free pro-p group of rank 2, let k > 2, and 

let Ijp denote the k th direct power of Z p (the additive group of p-adic integers). Then the 
group G = F xZj is GGS, but not GS. 

Proof. The group G has a natural presentation 

(zi,z 2 , ... ,yk | [zi,Vj] = 1, [ViiUj] = !)• (* * *) 

Thus, we have k + 2 generators and ( fe ^ 2 ) — 1 = k(k + 3)/2 relators of degree 2, and an easy 
computation shows that this presentation does not satisfy the GS condition for k > 2. To 
prove that no other presentation of G satisfies the GS condition, we can argue as follows. 
First, by Lemma [3.51 it is sufficient to consider presentations (X,R) with \X\ = d(G) = k + 2. 
We claim that in any such presentation R has at least k(k + 3)/2 relators of degree 2 (this 
will finish the proof). It is easy to see that the number of relators of degree 2 in R is at least 
log p \D 2 Fp(X))/D 3 Fp(X)\ - log p \D 2 G/D 3 G\ (recall that {D n H} is the Zassenhaus filtration 
of a group H). If x\, . . . ,Xk+ 2 are the elements of X, it is easy to see that a basis for 
D 2 Fp~(X) I D 3 Fp(X) is given by the images of commutators [xi, Xj] for i < j (here we use that 
p > 3), while D 2 G / D 3 G is cyclic of order p spanned by the image of \z\,z 2 \. Thus, the same 
computation as above finishes the proof. 

To prove that G is a GGS group, let (X, R) be the presentation of G given by (***), and 
consider the degree function D on F^(X) with respect to X given by D(z\) = D(z 2 ) = 1 
and D(yi) = N for 1 < i < k, where N is a large integer. Then D([zi,yj]) = N + 1 and 
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D([ yi , yj }) = 2N, so 1 - H x>D (r) + H R , D (r) = 1 - 2r - kr N + 2kr N+1 + ©r 2JV . This 
expression can be clearly made negative by first choosing r G (1/2, 1) and then taking a large 
enough N. Thus, G is indeed a GGS group. □ 

Remark: Essentially the same argument shows that the direct product of any GGS pro-p 
group G with any finitely generated pro-p group H will be GGS. 

In complete similarity to the group case, we will call a graded or a complete filtered K- 
algebra A generalized Golod-Shafarevich if there exists a presentation (in the corresponding 
category) (U, R) of A, a real number r G (0, 1) and a degree function d on K((U)) such that 1— 
Hu,d( T ) + HR,d( T ) < 0- However, it is not clear whether key properties of generalized Golod- 
Shafarevich groups (e.g. Theorem 14. 4p would remain true for algebras. In fact, the arguments 
of Voden |Vo| strongly suggest that even if A is a non-abelian free graded algebra, finite 
codimension graded subalgebras of A may not be generalized Golod-Shafarevich algebras. 
At the same time Voden |Voj proves that if a graded algebra A = ©^ A n has a minimal 
presentation (U,R), with \R\ < — l) 2 and \U\ > 1, then the Veronese power 

A (k) 

is 

Golod-Shafarevich for infinitely many values of k, where by definition A^ = (B^L^Akn, with 
k G N. At the moment it is not clear what should be the "right" substitute for the notion of 
generalized Golod-Shafarevich algebra (if any) which would lead to an interesting theory. 

4.3. Weight functions. In this subsection we introduce multiplicative counterparts of de- 
gree functions, called weight functions. Even though weight functions are obtained from de- 
gree functions merely by exponentiation, they provide a very convenient language for working 
with generalized Golod-Shafarevich groups. 

Definition. Let F be a free pro-p group, X a free generating set of F and U = {x— 1 : x G X} 
so that ¥ P [[F]] * ¥ P ((U)). 

(i) A function w : ¥ p [[F]] — > [0, 1) is called a weight function on ¥ P [[F]] with respect to 
U if there exists r G (0, 1) and a degree function d on F p [[i ? ]] with respect to U such 
that 

W (f)= T d (f). 

(ii) A function W : F — > [0, 1) is called a weight function on F with respect to X if there 
exists t G (0, 1) and a degree function D on F with respect to X such that 

W(f) = t d ^. 

Equivalently, W is a weight function on F with respect to X if there is a weight 
function w on Fpffi 7 ]] with respect to U such that W(f) = w(f — 1) for all / G F. 

If S is a subset of F and W is a weight function on F, we define W(S) = X^eS^^ 5 )' 
Thus, in our previous notations, if W = t d for a degree function D, then W(S) = Hs : d(t)- 
The definition of generalized Golod-Shafarevich groups can now be expressed as follows: 

Definition. A pro-p group G is a generalized Golod-Shafarevich group if there exists a 
presentation (X, R) of G and a weight function W on F^(X) with respect to X such that 

1 - W(X) + W(R) < 0. 

A weight function W will be called uniform if W = r for some r and the standard degree 
function D (that is, D{x) = 1 for all x G X). 
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5. Properties and applications of weight functions and valuations 

5.1. Dependence on the generating set. So far we defined weight functions with respect 
to a fixed generating set X. For many purposes it is convenient to have a "coordinate-free" 
characterization of weight functions, where the set X need not be specified in advance. 

Definition. Let F be a free pro-p group. A function W : F — > [0, 1) is called a weight 
function on F if W is a weight function on F with respect to X for some free generating set 
X of F. Any set X with this property will be called W-free. 

The first basic question is which sets are W-iree for a given weight function W on F? If 
W is a uniform weight function (that is, W = t d for the standard degree function D), then 
any free generating set X will be W-iree, since the standard degree function can be defined 
without reference to a specific free generating set. However, if W is not uniform, there always 
exists a free generating set X which is not W-free. This follows from Theorem 15.31 below and 
is illustrated by the following example. 

Lemma 5.1. Let F be a finitely generated free pro-p group, X = {xi, . . . , x m } a free gener- 
ating set of F and W a weight function on F with respect to X . Let f = x™ 1 . . . x™ fe where the 

indices i±, . . . , are distinct and each rii is not divisible by p. Then W(f) = max{W(xi j )}j = \- 

Proof. Let m = X{ — 1 G F p [[F]]. Then by definition W(f) = w(f — 1), where w is the unique 
weight function on F p [[F]] with respect to U = {u\, . . . ,u m } such that w{u{) = W(xj). Note 
that / — 1 = ^2 n j u ij + h where h is a sum of monomials, each of which involves at least two 
Ujj's. Since each nj / in ¥ p by assumption, we get W(f) = w(f — 1) = max{w;(ttj j )}j =1 = 
max{W(xi 3 )}^ =1 . □ 

Example 5.2. Let X = {xi,X2}, F = Fp(X) and a,/3 6 (0,1). Let W be the unique 
weight function on F with respect to X such that W{x\) = a and W{x2) = f3. Let X' = 
{xi, X1X2}. We claim that if a > (3, then W is NOT a weight function with respect to 
X' . Indeed, by Lemma \ 5.1l W(xiX2) = max{a,/3} = a. If W was also a weight function 
with respect to X' , Lemma HOI would have implied that W{x2) = W{x^[ l ■ x\Xz) is equal to 
m&x{W(xi),W(xiX2)} = a, which is false. 

If we assume that a < f3 in the above example, then W will be a weight function with 
respect to {x%, X1X2}, although this is harder to show by a direct computation. In general, 
we have the following criterion for VF-freeness. 

Theorem 5.3. Let W be a weight function on a free pro-p group F, let X be a free generating 
set of F, and assume that W(X) < 00. The following are equivalent. 

(i) X is W-free. 

(ii) If X' is any free generating set of F, then W(X) < W(X'). 

(iii) If X' is any free generating set of F, then there is a bisection a : X — > X' such that 
W(x) < W{a{x)) for all x G X. 

Proof. This is an easy consequence of results in [EJ31 § 3]. More specifically, let us say that 
X is a PF-optimal generating set if it satisfies condition (ii). The definition of a PF-optimal 
generating set in [EJ3] is different, but the two definitions are equivalent by Proposition 3.6 
and Corollary 3.7 of [EJ3J. Proposition 3.9 of [EJ3J then shows the equivalence of (i) and (ii) 
and Proposition 3.6 easily implies the equivalence of (ii) and (iii). □ 

One of the most important properties of weight functions is the following theorem: 
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Theorem 5.4. [EJ2, EJ3] Let F be a free pro-p group and W a weight function on F. If K 
is a closed subgroup of F , then W restricted to K is also a weight function. 

This theorem appears as |EJ2t Cor. 3.6] in the case when K is open in F and F is finitely 
generated and as [EJ3, Cor. 3.4] in the general case. The second proof is more conceptual 
(see a brief sketch in §[5j2), while the first one has the advantage of producing an algorithm 
for finding a W-free generating set for an open subgroup K (see a sketch in §[5j3). 

5.2. Valuations. One inconvenience in working with weight functions is that they are defined 
on free pro-p groups and not on the groups given by generators and relators we are trying to 
investigate. However, as we will explain below, given a pro-p group G and a free presentation 
7r : F — > G (that is, an epimorphism from a free pro-p group F to G), every weight function 
on F will induce a function on G satisfying certain properties. Such functions will be called 
valuations. 

Definition. Let G be a pro-p group. A continuous function W : G — > [0, 1) is called a 
valuation if 

(i) W(g) = if and only if g = 1; 

(ii) W(fg) < m<xx{W(f), W(g)} for any f,g G G; 

(iii) W([f,g}) < W(f)W(g) for any f,g G G; 

(iv) W{g p ) < W{gf for any g G G. 

It is easy to check that any weight function on a free pro-p group is a valuation, but the 
converse is not true - for instance, if W is a weight function on a free pro-p group F such 
that W(f) < 1/2 for all / G F, the function W'(f) = 2W(f) will be a valuation, but not a 
weight function. It is also not hard to show that given a free generating set X of F and a 
weight function W on F with respect to X the following is true: 

If W is any valuation on F such that W'{x) = W{x) for all x e X, then W'{f) < W{f) 
for all / G F. 

There are two simple ways to get new valuations from old. 

(i) If W is a valuation on G and if is a closed subgroup of G, then W restricted to H is 
clearly a valuation on H (we will denote the restricted valuation also by W). 

(ii) If 7r : H — > G is an epimorphism of pro-p groups, then every valuation W on H 
induces the corresponding valuation W on G given by 

W'(g) = w£{W(h) : h G H,ir(h) = g). 

In the special case when G is defined as a quotient of H and it : G — > H is the natural 
projection, we will usually denote the induced valuation on G also by W. 

An important special case of (ii) is that if G is a pro-p group and tt : F — > G is a free 
presentation of G, then every weight function on F will induce a valuation on G. By [EJ31 
Prop. 4.7], the converse is also true: every valuation on G is induced from some weight 
function in such a way; however, if G is finitely generated, one cannot guarantee that F can 
be chosen finitely generated. 

Given a valuation W on G and a G (0, 1), define the subgroups G a y/ and G <a ^w of G by 

G a ,w = {g€G: W(g) < a} and G <a , w = {g G G : W{g) < a}. 

The associated graded restricted Lie algebra L\v(G) is defined as follows: as a graded abelian 
group Lw{G) = ®aelm(W)Ga,W /G<a,w > the Lie bracket is defined by 

[gG <a ,w,hG <f s !W ] = [g,h]G <a p jW for all g G G a yv and h G Gp tW 
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(where [g,h] = g l h l gh) and the p-power operation is defined by 

(gG<a,w)^ = 9 p G < aP,w for all g G G a y/ and h G G^vf- 

If if is a closed subgroup of G, it is easy to see that Lw{H) (the Lie algebra of H with 
respect to the induced valuation) is naturally isomorphic to a subalgebra of L^y(G). The 
following characterization of weight functions among all valuations is obtained in [EJ3 . 

Theorem 5.5. ([EJ3, Corollary 3.3]) Let Fbea free pro-p group. A valuation W on F is a 
weight function if and only if Lw(F) is a free restricted Lie algebra. 

Since subalgebras of free restricted Lie algebras are free restricted, Theorem 15.41 follows 
from Theorem 15.51 and a paragraph preceding it. 

Finally, similarly to weight functions, given a valuation W on a pro-p group G and a subset 
S of G, we put W(S) = £ sgS W(s). 

5.3. Weighted rank, index and deficiency. Given a pro-p group G and a valuation W 
of G, there are three important numerical invariants - 

(i) the IV-rank of G, denoted by rk\y{G), 

(ii) the TV-deficiency of G, denoted by defw(G), and 

(iii) for every closed subgroup H of G, the TV-index of H in G, denoted by [G : H]w 

- which behave very similarly to their usual (non-weighted) counterparts. 
The definition of the TV-rank is the obvious one: 

Definition. The TV-rank of a pro-p group G, denoted by rkw(G), is the infimum of the 
set {IV(X)} where X ranges over all generating sets of G. In fact, a standard compactness 
argument shows that if this infimum is finite, it must be attained on some set X. 

Before defining TV-deficiency, we need some additional terminology. 

Definition. A valuation IV on a pro-p group G is called finite if there is a free presentation 
7r : F — > G and a weight function TV on F which induces IV and such that rkyy(F) < oo. 

Note that in many cases finiteness of a valuation IV holds automatically: this is the case if 
W is a weight function on a finitely generated free pro-p group F or if IV is quotient-induced 
from such a weight function. In applications of Golod-Shafarevich groups all valuations will 
be obtained in such way, so the problem of verifying finiteness of a valuation never arises in 
practice. In more theoretical contexts, one can use the following criterion (|EJ3I Prop. 4.7]): 
a valuation W on G is finite if and only if there is a subset Y of G, with H^(y) < oo, s.t. the 
elements {yG^^y/ : y G Y} generate the Lie algebra L]y(G). 

Definition. 

(a) A weighted presentation is a triple (X, R, W) where (X, R) is a (pro-p) presentation 
and W is a weight function on Fp(X) with respect to X. 

(b) Let (X, R, W) be a weighted presentation, where W(X) < oo. We set defw(X, R) = 
W{X) - W(R) - 1. 

(c) Let G be a pro-p group and W a finite valuation on G. The VF-deficiency of G, 
denoted by defw(G), is defined to be the supremum of the set {defy^(X, R)} where 

(X, R, W) ranges over all weighted presentations such that G = (X\R), W induces 
W and W(X) < oo. 

Note that we can now rephrase the definition of GGS groups as follows. 
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Definition. A pro-p group G is GGS if and only if defw(G) > for some finite valuation 
W of G. 

Thus, GGS groups can be thought of as groups of positive weighted deficiency (explaining 
the title of [EJ3]). We will not use the latter terminology in this paper, but we will work 
with the quantity defw(G), which is very convenient. 

Recall two classical inequalities relating the usual (non-weighted) notions of rank, defi- 
ciency and index. 

Theorem 5.6. Let G be a finitely generated abstract or a pro-p group and H a finite index 
subgroup of G. Then 

(a) d(H) — 1 < (d(G) — 1)[G : H\. Moreover, if G is free, then H is free and equality 
holds. 

(b) def(H)-l>(def(G)-l)[G:H}. 

Part (a) is just the Schreier formula, and (b) is an easy consequence of (a) and the 
Reidemeister-Schreier rewriting process (see, e.g., [Qs2[ Lemma 2.1]). 

It turns out that one can define VF-index [G : H]w in such a way that the weighted 
analogues of (a) and (b) will hold. 

Definition. Let W be a valuation on a pro-p group G and H a closed subgroup of G. For 
each a G Im (W) let c a! w(G/ H) = log p \G a ,wH/G <ot! wH\. The quantity 

[G :H\ W = H 

a£lm (W) 

is called the W -index of H in G. 

In the next subsection we will reveal where the above formula comes from. At this 
point we just observe that the usual index [G : H] is given by the formula [G : H] = 
pZ)aeim(w) c a,w(G/H) ^ jj encei [f we g x jj anc [ CO nsider a sequence {W n } of valuations on G 
which converges pointwise to the constant function 1 on G\{1}, then the sequence \G : H]\y n 
will converge to [G : H]. 

Here is the weighted counterpart of Theorem 15.61 We will discuss the idea of its proof in 
the next subsection. 

Theorem 5.7. Let W be a valuation on a pro-p group G and let H be a closed subgroup of 
G with [G : H]\y < oo. Then 

(a) rkw(H) — 1 < (rkw(G) — l)[G : H\w- Moreover, if G is free and W is a weight 
function, then equality hodls. 

(b) def w {H)>def w {G)[G:H] w . 

Note that part (b) immediately implies that an open subgroup of a GGS pro-p group is 
also a GGS pro-p group, the result stated earlier as Theorem 14.41 
Below are some properties of VF-index which we shall need later: 

Proposition 5.8. Let W be a valuation on a pro-p group G. Then VF-index is multiplicative, 
that is, if K C H are closed subgroups of G, then [G : K]yy = [G : H]w ■ [H : K]yy. 

Proposition 5.9 (Continuity lemma). Let W be a valuation on a pro-p group G, let H be 
a closed subgroup of G, and let {U n } be a descending chain of open subgroups of G such that 
H = C\U n . Then 

(i) [G : H] w = lim ri _ > . 0O [G : U n ] W - 
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(ii) // [G : H]w < oo, then rkw(H) = lim„_ ) . 0O rkw(U n ). 

(iii) If [G : H]w < oo , then defw{H) = lim„->oo defw(U n ) 

Proposition 15.81 is straightforward, and Continuity Lemma is established in [E J3] : (i) is 
|EJ3l Lemma 3.15], (ii) is [EJ3, Lemma 3.17] and (iii) follows from the proof of \EJ3\ Prop. 4.3] 
although it is not explicitly stated there. 

5.4. Proof of Theorem 15.71 (sketch). Proposition 15.91 reduces both (a) and (b) to the 

case when H is an open subgroup. By Proposition 15.81 it suffices to consider the case when 
[G:H]=p. 

(a) We first treat the case when G is free and TV is a weight function. Let X be any 
VF-free generating set of F. First, replacing X by another T^-free generating set, we can 
always assume that 

There is just one element x £ X which lies outside of H. (* * *) 

To achieve this, we let x be the element of the original set X which lies outside of H and has 
the smallest W-weight among all such elements and then, for each z £ X \ {x} \ H, replace 
z by zx m for suitable m G Z so that zx m € H (such m exists since G/H is cyclic of prime 
order and x H ) . Condition (ii) in the definition of a valuation and Theorem 15.31 ensure 
that the new generating set of F is still W-free. 

From now on we shall assume that (***) holds. A standard application of the Schreier 
method shows that H is freely generated by the set X' = {y x * : y £ X\{x}, < i < p}U{x p }. 
This set, however, is not W-hee by Theorem 15.31 since it clearly does not have the smallest 
possible VF-weight - for instance, one can replace y x by y~ l y x = [y,x], thereby decreasing 
the total weight as W([y, x\) < W(y)W(x) < W(y) = W(y x ). It is not difficult to show that 
the VF-weight will be minimized on the generating set 

(5.1) X = U veX \{x}{y, [y,x], [y,x,x],.. . , [y, x,..., x] } U {x p }. 

p-l times 

Informally, this happens because if we let U = {x — 1 : x G X} (so that F P [[F]] = ¥ p ((U))) 
and expand elements of the set U = {x — 1 : x G X} as power series in U, then the monomials 
of maximal VF-weight in those expansions will all be distinct (this is a Grobner basis type of 
argument). Thus, by Theorem 15.31 X is VF-free. 

Note that if r = W(x), then in the above formula we have 

W(X) - 1 = (W(X) - t)(1 + r + . . . + rP" 1 ) + t p - 1 = (W(X) - 1) • IZJL, 

1 — T 

Again by Theorem 15.31 we have W(X) = rkw(F) and W(X) = rkw{H)- Moreover, it is 
not difficult to show that c a ^w(G/H) is equal to 1 for a = r and for a ^ r. Hence 
[G : H]w = , and we are done. 

In the case when G is an arbitrary group, we can essentially repeat the above argument, 
assuming at the beginning that X is a generating set of G with W(X) = rk\y(G). The set X 
given by (|5.ip will still generate H, but we no longer know whether W{X) equals rkyy(H); 
this is why we can only claim inequality in the formula. 

(b) follows easily from (a) and the Schreier method. First, by Propositions 15.81 and 15.91 
we can again assume that [G : H] = p. Consider any weighted presentation (X, R, W) 
of G where W induces W and W(X) < oo. As before, we can assume that (***) from 
(a) holds. Then H is given by the presentation (X,R') where X is as before and R' = 
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{r xl : r G R, < i < p}. Again, we can replace the set of relators R' by the set R = 
U re /j{r, [r, x], [r, x, x], . . . , [r, x, . . . , x] }, and by direct computation W(R) < W{R)[G : H]w- 

p—\ times 

Since W(X) - 1 = (W(X) - 1)[G : f% by the proof of (a), we conclude that def w {X, R) > 
def^y(X, R)[G : H]w, which yields (b) by taking the supremum of both sides over all triples 

(X,R,W). 

5.5. VF-index and Quillen's theorem. Although we already gave some indication why 
the notion of VK-index is a useful tool, its definition may still appear mysterious. Below we 
state another formula generalizing (a version of) Quillen's theorem, where W-index naturally 
appears. In order to state it, we have to go back to degree functions and also introduce some 
additional notations. 

Let G be a pro-p group, ir : F — > G a free presentation of G, d a degree function on F p [[F]] 
and D the corresponding degree function on F, that is, D(a) = d(a — 1). Then D induces a 
function on G (by abuse of notation also denoted by D) given by 

£>(<?) =inf {£>(/) : f € F,*(f) = 9}- 
For each A G M> let G A > D = {g G G : D(g) > A}, G >X > D = {g G G : D(g) > A} and 
c^ D {G) = [G A - D : G >X ' D ] 

Theorem 5.10. Let G,F,Tr,d,D be as above. 

(a) (Quillen's theorem) The following equality of generalized power series holds: 

„X,Dtn\ 

(5-2) Hilb ¥p[[G]ld (t) = 11 

Aelm (D) 

(b) Let t G (0, 1), and define the function W : G ->• [0, 1) by W(g) = r D (f). Then W is a 
valuation on G and ifi/6F p [[G]],rf( T ) = [G : the W -index of the trivial subgroup. 

Sketch of proof. The idea of the proof of (a) is very simple. Consider the graded restricted Lie 
algebra L D (G) = (B\^i m ^D)G X ' D /G >X,D associated to the degree function D and the graded 
associative algebra prrfF p [[G]] = ©Aeim(<i)^ , p[[G ! ]] A '' i /Fp[[G]] >A ' c( associated to the degree func- 
tion d (in fact, L D (G) coincides with the Lie algebra Lw(G) defined in §02, corresponding 
to the valuation W = t d for any r G (0, 1)). It turns out that the restricted universal en- 
veloping algebra U(L D (G)) is isomorphic to grdF p [[G]] as a graded associative algebra. Hence 
Hilb ¥p [[G]] ,d(t), which by definition is the Hilbert series of pr^FpffG]], must equal the Hilbert 
series of U(L D (G)), which is equal to the right-hand side of (|5.2[) by the Poincare-Birkhoff- 
Witt theorem for restricted Lie algebras. 

In the case when D is the standard degree function, the isomorphism gr^FpffG]] = U(L D (G)) 
is known as Quillen's theorem and its detailed proof can be found, for instance, in [DDMS , 
Ch.11,12]. In |EJ2t Prop. 2.3], (a) is proved for the case of integer-valued degree functions 
D, but the same argument works for arbitrary D. 

(b) It is clear that TV is a valuation on G. Note that G X,D = G T \ w and G >X,D = G <T x w , 
so c x,D (G) = c T x W (G/{1}). Therefore, if we set t = r in (|5.2|) . the right-hand side becomes 
equal to [G : {l}]w/ by definition. □ 

Corollary 5.11. Let G be a GGS pro-p group, and let W be a valuation on G such that 
defw(G) > 0. Then [G : {l}]w = °°> an d therefore by Proposition 15.91 (Continuity lemma), 
the set {[G : where U runs over open subgroups of G, is unbonded from above. 
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Proof. Let (X,R,W) be a weighted presentation of G such that def^y(X, R) > and W 

induces W . By definition W = t d for some r £ (0, 1) and degree function D on F = 
Fp(X) with respect to X. Let d be the degree function on F p [[F]] corresponding to D. By 
Corollary 14.31 the series Hilb^ [[G]],d( T ) diverges, so the result follows from Theorem 15. 10f b) . 

□ 

Using Quillen's theorem, we can also interpret inequality in Theorem 15.7( b) (or rather a 
slightly stronger version of it) as yet another generalization of GS inequality. 

First we restate Theorem l5.7( b) in terms of weight functions (formally the statement below 
is stronger, but it follows immediately from the proof): 

Theorem 5.12. Let G be a pro-p group given by a presentation (X,R) and let W be a 
weight function on Fp{X) with respect to X. Let K be an open subgroup of G. Then K has 
a presentation (X',R'), with X' C Fp{X), such that 

(1 - W(X') + W{R')) < (1 - W(X) + W(R)) ■ [G : K] w . 

Suppose now that W = t d for a degree function D and r 6 (0, 1). As in the proof of 
Theorem \^Mjo) we have c X ' D (G/K) = c T x W {G/K), so Theorem EH can now be restated as 
a numerical inequality 



" X,D (G/ K) 



(5.3) l-H x ,, D (T) + H R , iD (T)<(l-H XtD (T) + H R , D (T))- H (y^P) 

One can show (see [EJ2, Theorem 3.11(a)]) that if D is integer-valued (this time it is an 
essential assumption), then by dividing both sides of (|5.3p by 1 — r and replacing a real 
number r by the formal variable t, we get a valid inequality of power series (the proof of this 
result follows the same scheme as that of Theorem 15. 7f b) ) : 

Theorem 5.13. (|EJ2, Theorem 3.11(a)]) Let G be a pro-p group, (X,R) a presentation of 
G and D an integer-valued degree function on F = Fp(X) with respect to X . Let K be an 
open subgroup of G. Then there exists a presentation (X',R') of K, with X' C Fp(X), such 
that the following inequality of power series holds: 

f54 x 1 - Hx>,p(t) + H R , tD [t) 1 - H x ,p(t) + H R , D (t) tt / l-t n P ^ cn ' D{G/K) 

[ ' ' i-t ~ i-t W 1 i-* n 

raelm(D) v 

Finally, assume that K is normal in G. Then applying Theorem I5.10f a) to the quotient 
group G/K and letting d be the degree function corresponding to D, we can rewrite (|5.4p as 
follows: 

( 5 - 5 ) Y^t T^t ff "0F P [[G/Jf]],dW- 

This inequality can be thought of as a finitary version of the generalized Golod-Shafarevich 
inequality (|4.3|) : in fact, (|4.3p can be deduced from (|5.5p . as explained in [EJ2] (see a remark 
after Theorem 3.11). Moreover, (|5.5|) remains true even without the assumption that K is 
normal in G, but Hilb$ [[G/K]],d(P) wr ^ need to be defined differently. 

5.6. A proof without Hilbert series. In conclusion of this long section we shall give a 
short alternative proof of the fact that GGS pro-p groups are infinite, which does not use 
Hilbert series. The proof is based on the following lemma, which we shall also need later for 
other purposes. 
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Lemma 5.14. Let G be a pro-p group and (X,R,W) a weighted presentation of G, where 
W is finite. Then 

(a) d(G) > W(X) - W(R). 

(b) Let a > 0, and given a subset S of F(X), let S> a = {s 6 S : W{s) > a}. Then 
d(G)>W(X> a )-W(R> a ). 

Proof Note that (a) follows from (b) by letting a — > 0, so we shall only prove (b). Since 
G is pro-p, there is a subset Y C X such that Y generates G and |y| = d(G). Then 
the presentation (X, RUY) defines the trivial group. Hence RUY generates Fp(X) as a 
normal subgroup of itself, and since Fp(X) is pro-p, RUY generates Fp{X) as a pro-p group. 
Since X is TV-free, by Theorem Qi)(iii), we have W(X> a ) < W{{R U Y)> a ), whence 
W(X> a ) < W(R> Q ) + W(Y> a ) < W(R> a ) + \Y> a \ < W{R> a ) + d{G). □ 

Using Lemma 15.141 and Theorem 15.7( b). it is now very easy to show that GGS pro-p 
groups are infinite. Indeed, suppose that G is a GGS pro-p group, so that defw(G) > for 
some W. Then W{X) — W(R) > for some weighted presentation (X, R, W) of G, so by 
Lemma 15.14( a) , G is non-trivial and in particular has an open subgroup of index p, call it 
H. By Theorem 15 .7} H is also GGS. We can then apply the same argument to H and repeat 
this process indefinitely, thus showing that G is infinite. 

6. Quotients of generalized Golod-Shafarevich groups 

One of the reasons (generalized) Golod-Shafarevich groups are so useful is that they possess 
infinite quotients with many prescribed group-theoretic properties. Some results of this type 
are deep and require original arguments, but in many cases all one needs is the following 
obvious lemma: 

Lemma 6.1. Let G be a pro-p group and W a valuation on G. Lf S is any subset of G, then 
def w (G/(S) G ) > def w {G) - W(S). 

As the first application of this lemma, we shall prove a simple but extremely useful result 
due to J. Wilson [Wi2], 

Theorem 6.2. Every GS (resp. GGS) abstract group has a torsion quotient which is also 
GS (resp. GGS). 

Proof. The argument is just a minor variation of the second proof of Theorem 12.51 but we 
shall state it using our newly developed language. Let T be a GGS abstract group (which 
can be assumed to be residually-p) , G = Tp and W a valuation on G such that defw{G) > 0. 
For each g G V we can choose an integer k(g) € N such that if R = {g pfe(9> : g £ r}, then 
W(R) < defw{G). 

Let r" = F/(R) r . Then r' is torsion; on the other hand, the pro-p completion of T' is 
isomorphic to G' = G/(R) G which is GGS by Lemma 16.11 

If r is GS, then we can assume that the initial W is induced by a uniform weight function, 
whence T' is also GS. □ 

The argument used to prove Theorem 16.21 has the following obvious generalization: 

Observation 6.3. Let (P) be some group-theoretic property such that 

(i) (P) is inherited by quotients, 

(ii) given an abstract group V , a valuation W on its pro-p completion G = T^ and £ > 0, 
there exists a subset R £ ofTp such that W(R £ ) < e and the image of F in G/{R £ ) 
has (P). 
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Then any GGS group (resp. GS group) has a GGS quotient (resp. GS quotient) with (P). 
Moreover, this quotient can be made residually finite. 

Of course, in the proof of Theorem 16.21 (P) was the property of being a p-torsion group. 
Below we state several other results which can be proved using Observation 16 . 31 or its variation. 

Theorem 6.4. (|EJ3j Theorem 1.2]) Every GGS abstract group has a GGS quotient with 
property LERF. 

Theorem 6.5 ([Er2]). Every GGS abstract group has a residually finite quotient whose FC- 
radical (the set of elements centralizing a finite index subgroup) is not virtually abelian. (Of 
course, such a quotient must be infinite). 

Theorem 6.6 (|MyaOs|). Every recursively presented GS abstract group has a GS quotient 
Q which is algorithmically finite (this means that no algorithm can produce an infinite set of 
pairwise distinct elements inQ). 

For the motivation and proofs of these results the reader is referred to the respective papers 
(the first two theorems will be mentioned again in § fT2l) . Here we remark that verification 
of condition (ii) in the proofs of these three theorems is not as straightforward as it was 
in Theorem 16.21 In particular, the set of additional relators R e cannot be described "right 
away"; instead it is constructed via certain iterated process. 

We finish this section with two useful technical results, which are also based on Lemma [6. 11 

Lemma 6.7 (Tails Lemma). Let G be a GGS pro-p group. Let A andT be countable subgroups 
of G with A C r and A dense in V. Then G has a GGS quotient G' such that A and T have 
the same image in G' . 

Proof. Let W be a valuation on G such that defw(G) > 0. Since A is countable and dense in 
T and W is continuous, for each g € T, we can choose l g 6 A such that if R = {lg l g '■ g 6 T}, 
then W(R) < defw{G). It is clear that the group G/(R) G has the required property. □ 

Remark: The terminology 'tails lemma' is based on the following "visualization" of the 
above procedure: we represent each element g G T as l g ■ (l g ~ 1 g) where l g is a good approxi- 
mation of g by an element of A and l g ~ 1 g is a tail of g (which is analogous to a tail of a power 
series). The desired quotient G' of G is constructed by cutting all the tails. 

Lemma 6.8. Let G be a GGS pro-p group. Then some quotient Q of G has a weighted 
presentation (X,R,W) such that defw(X, R) > and X is finite (in particular, Q is a 
finitely generated GGS pro-p group). 

Proof. By definition, G has a weighted presentation (Xq, Rq,W) with def\y(Xo, Rq) > 0. 
Choose a finite subset X C Xq such that W(X \ Xq) < defw(Xo, Ro). Then it is easy to 
check that the group Q = G/(X \ Xq) g has the required property. □ 

7. Free subgroups in generalized Golod-Shafarevich pro-p groups 

As we saw in § [3l Golod-Shafarevich abstract groups may be torsion and therefore need 
not contain free subgroups. In this section we shall discuss a remarkable theorem of Zel- 
manov |Zelj which asserts that Golod-Shafarevich pro-p groups always contain non-abelian 
free pro-p subgroups. In fact, we will show that the proof of this result easily extends to 
generalized Golod-Shafarevich pro-p groups. 

Theorem 7.1 (Zelmanov). Every generalized Golod-Shafarevich pro-p group contains a non- 
abelian free pro-p subgroup. 
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It is not a big surprise that Golod-Shafarevich pro-p groups contain non-abelian free ab- 
stract groups since the latter property seems to hold for all known examples of non-solvable 
pro-p groups. However, containing a non-abelian free pro-p subgroup is a really strong prop- 
erty for a pro-p group. For instance, pro-p groups linear over Z p or F p [[f]] cannot contain 
non-abelian free pro-p subgroups [BL], and it is conjectured that the same is true for pro-p 
groups linear over any pro-p ring. 

We start with some general observations. Let G be a finitely generated pro-p group. There 
is a well-known technique for proving that G contains a non-abelian free abstract subgroup. 
Let F(2) be the free abstract group of rank 2, and suppose that F(2) does not embed into 
G. Then for any g,h G G there exists a non-identity word w G F(2) such that w(g,h) = 1. 
Thus 

G x G = U^ eF ( 2 )\{i}(G x G) w where (G x G) w = {(g, h) G G x G : w(g, h) = 1}. (* * *) 

It is easy to see that each subset (G x G) w is closed in G x G, and since F(2) is countable, 
while G x G is complete (as a metric space), Baire category theorem and (***) imply that 
for some w G F{2) \ {1}, the set (G x G) w is open in G x G, so in particular, it contains 
a coset of some open subgroup. The latter has various strong consequences (e.g. it implies 
that the Lie algebra L(G) satisfies an identity), which in many cases contradicts some known 
property of G. 

The following technical result is a (routine) generalization of [ZeU Lemma 1] from GS to 
GGS algebras and is proved similarly to Theorem 13.31 

Lemma 7.2. Let K be a countable field, and let A be a generalized Golod-Shafarevich 
complete filtered algebra, that is, A has a presentation (U\R) s.t. 1 — Hi/^(t) + Hr^(t) < 
for some r G (0,1) and some degree function d on K((U}}. Let A a b s be the (abstract) 
subalgebra of A (without 1) generated by U. Then there exist an epimorphism ir : A — > A' , 
with A' also GGS, and a function v : N — > N such that for any n G N, any n elements of 
n{A v Jj^ ') generate a nilpotent subalgebra. 

7.1. Sketch of proof of Theorem 17.11 The proof of Theorem 17.11 roughly consists of two 
parts - reducing the problem to certain question about associative algebras (Proposition on 
p. 227 in |Zel| ) and then proving the proposition. This proposition is actually the deeper part 
of Zelmanov's theorem, but since it is not directly related to GS groups or algebras and its 
proof is somewhat technical, we have chosen to skip this part in our survey and concentrate 
on the first part of the proof. This will be sufficient to make it clear that the proof applies 
to GGS groups and not just GS groups as stated in [Zelj . 

Let G be a GGS pro-p group and assume that it does not contain a free pro-p group of 
rank 2, denoted by F%. As above, denote by F{2) the free abstract group of rank 2. Let D 
denote the standard degree function defined in § 13.11 (not the degree function which makes 
G a GGS group!) In this proof we shall use the definition of D in terms of the Zassenhaus 
filtration (see Proposition I3.2H . 

Step 1: Since F2 is uncountable, the above approach for proving the existence of a non- 
abelian free abstract subgroup cannot be applied directly. However we can still say that for 
any g,h G G there exists a non-identity element w G F2 (this time w may be an infinite word) 
such that w(g, h) = 1. Equivalently, for any g,h G G there is a non-identity word w G F(2) 
such that w(g, h) = w'(g, h) for some w' G F2 with D(w') > D(w). 

Step 2: The same application of the Baire category theorem as above implies that there is 
w G ^(2), elements go, h$ G G and an open subgroup K of G such that for any g,h G K we 
have w(gog, h$h) = w'(gog, hoh) for some w' G F2 (depending on g and h) with D(w') > D(w). 
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Step 3: Since Steps 1 and 2 can be applied to any open subgroup of G, we can assume 
in Step 2 that W(go) and W(ho) are as small as we want, where W is a valuation on G s.t. 
defw(G) > 0. In particular, by Lemma IfTTl we can ensure that the group G' = G/(go,ho) 
is also GGS. The image of K in G' , call it K , is also GGS by Theorem 14.41 Thus, replacing 
G by K' (and changing the notations), we can assume that 

for any g,h G G there is w' G F 2 , with D(w') > D(w), s.t. w(g, h) = w'(g, h) (***) 

Step 4-' If k = D(w), then we can multiply w by any element of Dk+iF 2 without affecting 
(***). In this way we can assume that w is a product of elements c p where c is a left-normed 
commutator of degree k/p e . Next note that if some w satisfies (***) and v G F 2 is such that 
D([w,v]) = D(v) + D(w), then (***) still holds with w replaced by [w, v]. After applying 
this operation several times, we can assume that w = c\ . . . c% where each a is a left-normed 
commutator of length D(w). 

Step 5: Now let L 2 be the free F p -Lie algebra of rank 2 and Lie(w) = Lie(c\)+. . .+Lie(c t ) G 
L 2 where Lie(ci) is the Lie commutator corresponding to Cj. It is not difficult to see that 
condition (***) can now be restated as the following equality in F P [[G]]: for any g, h G G we 
have 

Lie(w)(g - 1, h - 1) = O k+ i(g -l,h-l) 

where Lie(w)(g — 1, h — 1) is simply the element Lie{w) evaluated at the pair (g— 1, h — 1) and, 
for a finite set of elements ai, . . . , a s , Ofc+i(ai, . . . , a s ) is a (possibly infinite) but converging 
sum of products of a\, . . . , a s , with each product of length > k + 1 (recall that k = D(w)). 

Thinking of Lie(w) as an element of the free associative F p -algebra of rank 2, we can 
consider the full linearization of Lie(w), call it /. Then / is a polynomial of degree k in k 
variables, and it is easy to check that for any gi,. . . ,gk G G we have f(gi — 1, . . . , gp. — 1) = 
Ok+i{gi - 1, ... ,0k - 1). 

Step 6: Let (X, R) be a presentation for G satisfying the GGS condition. Then the 
algebra A = ¥ P [[G]] is GGS (with presentation (U,R a i g ) where U = {x — 1 : x G X} and 
Raig = {r — 1 : r G R}). Let us apply Lemma 17^21 to A, and let tt : A — > A' be as in the 
conclusion of that lemma. Note that we do not know whether the group G' = tt{G) is GGS, 
but the fact that A' is a GGS algebra will be sufficient. Let B = Tr(A a b s ) (in the notations of 
Lemma 17^21) . and let T be the abstract subgroup of G' generated by (the image of) X. Note 
that T C l + B = {l + b: be B}, and moreover D m T C 1 + B m for all m G N. Therefore, 
we have the following (with part (ii) being a consequence of Step 5). 

(i) There exists a function v : N — > N such that for any n G N, any n elements of B u ^ 
generate a nilpotent subalgebra. 

(ii) There exists a multilinear polynomial / of degree k such that for any gi,...,gj. G 
DpftyF, the element f{g\ — 1, ...,<7fc — 1) is equal to a finite sum of products of 
gi — 1, . . . , gk — 1, with each product of length at least k + 1 (the sum must be finite 

by (i)). 

As proved in \Lel\ Proposition, p. 227], if B is a finitely generated F p -algebra and T is a 
subgroup of 1 + B such that (i) and (ii) above hold, then B is nilpotent (and hence finite- 
dimensional). This yields the desired contradiction since in our setting ¥ p + B is a dense 
subalgebra of A' = it (A), which is GGS and therefore infinite-dimensional. This concludes 
our sketch of proof of Theorem 17.11 

As we already mentioned, the characteristic zero counterpart of Theorem 17.11 was estab- 
lished by Kassabov in |Ka| . who showed that every Golod-Shafarevich prounipotent group 
contains a non-abelian free prounipotent subgroup. 
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8. Subgroup growth of generalized Golod-Shafarevich groups 

In this section we shall discuss various results about subgroup growth of GGS groups. 
We shall pay particular attention to this topic in this paper not only because of its intrinsic 
importance, but also because there are two classes which contain many Golod-Shafarevich 
groups - Galois groups Gk, p ,s and fundamental groups of hyperbolic 3-manifolds - where 
subgroup growth has a direct number-theoretic (resp. topological) interpretation. 

We shall restrict our discussion to GGS pro-p groups. Since the majority of the results we 
state deal with lower bounds on subgroup growth and the subgroup growth of an abstract 
group T is bounded below by the subgroup growth of its pro-p completion, these results yield 
the corresponding lower bounds for the subgroup growth of GGS abstract groups. 

8.1. Some generalities on subgroup growth. If G is a finitely generated pro-p group, 
denote by a m (G) the number of open subgroup of G of index m (note that a m (G) = unless 
m is a power of p). The asympotic behaviour of the sequence {a pk (G)}k>i is closely related 
to that of the sequence {i"k(G)} defined below, the latter being much easier to control. 

Lemma 8.1. Let G be a finitely generated pro-p group. For each k G N let 

r k(G) = max{d(U) : U is an open subgroup of G of index p k }. 

Then 

(i) a pk {G)>p^ G ) -1; 

(ii) a pk {G) < a pk -i(G) ■ (p r ^ G ) - 1), and therefore a pk {G) < pDto^G). 

Proof, (i) Let U be an open subgroup of index p k ~ 1 with d(U) = rfc_i(G). The quotient 
U/[U, U]U P is a vector space over F p of dimension d(U) and therefore has p d ^ — 1 subspaces 
of codimension 1. These subspaces correspond to subgroups of index p in U, and each of 
those subgroups has index p k in G. 

(ii) follows from the same argument and the fact that each subgroup of index p k in a pro-p 
group is contained in a subgroup of index p k ~ 1 . □ 

By the Schreier index formula, the sequence {rk(G)} grows at most linearly in p k . Hence, 
Lemma 18.11 shows that the subgroup growth of any finitely generated pro-p group G is at 
most exponential, and the subgroup growth is exponential if and only if inf r^{G)/p k > 0. In 

k 

fact, there is an even more elegant characterization of exponential subgroup growth due to 
Lackenby |La31 Theorem 8.1]. 

Definition. 

(a) Let G be a finitely generated pro-p group and {G n } a strictly descending chain of 
open normal subgroups of G. We will say that {G n } is an LRG chain (where LRG 
stands for linear rank growth) if mi(d(G n ) — 1)/[G : G n ] > 0. 

n 

(b) Let r be a finitely generated abstract group and {T n } a strictly descending chain of 
normal subgroups of T of p-power index. We will say that {r n } is an LRG p-chain 
if inf(d p (r n ) - l)/[r : T n ] > 0. (Recall that d p {K) = d(A/[A, A]A P ) = d(A p ) for an 

abstract group A.) 

Theorem 8.2 (Lackenby). Let G be a finitely generated pro-p group. The following are 
equivalent. 

(i) G has exponential subgroup growth 

(ii) There is c > such that rj.(G) > cp k for all k. 
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(iii) There is c > such that rk(G) > cp k for infinitely many k. 

(iv) G has an LRG chain. 

Proof. The equivalence of (i) and (ii) has already been discussed. The implication "(iv)=> 
(iii)" is clear. The implication "(iii)=^ (ii)" follows from the fact that the quantity (d(U) — 
1)/[G : U] does not increase if U is replaced by its open subgroup, and hence the sequence 
rfc( y.- 1 is non-increasing. Finally, the implication "(h)=^ (iv)" is established by a Cantor 
diagonal argument (see [La3j Theorem 8.1] for details). □ 

8.2. Subgroup growth of GGS groups. As we will see later in the paper, many naturally 
occurring GS groups have LRG chains and therefore have exponential subgroup growth. It 
is very likely that there exist GS groups with subexponental subgroup growth, but to the 
best of our knowledge, this problem is still open. The best currently known lower bound 
on the subgroup growth of GS groups (which also applies to GGS groups) is due to Jaikin- 
Zapirain \EJ2\ Appendix B] and is best stated in terms of the sequence {rfc(G)}. 

Theorem 8.3 (Jaikin-Zapirain). Let G be a finitely generated generalized Golod-Shafarevich 
pro-p group. Then there exists a constant (3 = (3(G) > such that Tk[G) > p h for infinitely 
many k. 

Proof. Let (X, R) be a presentation of G and D an integer- valued degree function on F = 
F^(X) with respect to X such that Hx,d(t) — Hr,d(t) — 1 > for some r G (0,1). Let 
7r : F — > G be the natural projection, and for each n G N let 

G n = {g G G : g = tt(/) for some / G F with D(f) > n} and c n = log p [G n : G n+1 ] 

(Thus, G n = G n ' D and c n = c n > D (G) in the notations of § [53]). 

By Theorem 15. 131 there exists a presentation (X n , R n ) of G n , with X n C Fp(X), such that 

H XniD {r) - H Rn , D {r) - 1 > (H x , D (r) - H R>D (r) - 1) Ui=o (j^Y > whence 

log p (H Xn Mr) ~ H RntD (r) - 1) > log p (H x>D ( T ) - H r , d {t) - 1) + c^log^l + r"" 1 ) 
By Lemma[5j3Ia), d(G n ) > H x „,d(t) - H Rji ^ d (t), whence 

log p d(G n ) > c^-ilogpCl + r™" 1 ) + log (H x ,d(t) ~ H R,d(^) ~ 1) > t^—^t^ 1 - E, 
F F ' zlogp 

where E is a constant independent of n. 

Now take < T\ < r such that H x ,d(ti) ~ H RjD (ti) - 1 > 0. By Theorem 15.101 the 

] diverges, whence the series Yl^o c * r l a ^ so diverges. Thus if 

c = limsup^/ci, then ct% > 1, so cr > 1. Now choose any a G (1, cr). Then limsup : 
oo, whence 

log p d(G n ) > a n for infinitely many n. (* * *) 

On the other hand, the trivial upper bound c n < \X\ n implies that log p [G : G n ] = 
YJIZq < \X\ n . Thus, if we let k = log p [G : G n ] where n satisfies (***), then 

log a a loE OL 

log p r fc (G) > log, p d(G n ) > |X| n ™ > for /3 = . 

□ 
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8.3. Subgroup growth of groups of non-negative deficiency. In the proof of Theo- 
rem 18.31 we used Theorem 15.131 as a numerical inequality. The fact that it holds as inequality 
of power series also has a very interesting consequence. 

Notation. Given positive integers n,m and p, define (^) to be the coefficient of t m in 

the polynomial (1 + 1 + . . . + t p-1 ) n . Thus, Q^) 2 = (^) is the usual binomial coefficient. 

Proposition 8.4. Let G be a finitely presented pro-p group and K an open subgroup of G 
containing <&(G) = [G, G]G P , so that G/K = (Z/pZ) n for some n. Then for any integer 
< I < (p — l)n the following inequality holds: 

4K)>AG)±^ -(G) g (I) " 

i=0 W P i=0 V J P i=0 W P 

Proof. Let (X, R) be a minimal presentation of G (so that \X\ = d(G) and \R\ = r(G)). We 
shall apply Theorem 15. 131 to this presentation and the standard degree function D on Fp(X). 
Since (in the notations of § EE) G 2 ' D = [G, G]G P , we have G 2 ' D C K, so c l > D {G/K) = n and 
c l ' D (G / K) = for i > 1. Thus, by Theorem 15. 13} K has a presentation (X',R') such that 

H xl , D (t) ~ H«Mt) - 1 > d{G)t - HnMt) -l , 1 + t+ +tP - r 

1-t ~ 1-t v "' ; v ' 

Let us write H x , )D (t) = J2 d i(K)t\ H R , }D (t) = ^r^K)? and H B,D(t) = E r ^- Note that 
t\ = by Lemma l3.5f i) since the presentation (X,R) is minimal and Yli>2 ri = r (^0- 
Computing the coefficient of t l+1 on both sides of (***), we obtain 



l-l / \ 1+1/ 

I -E I 

i<i+l j</+l j=0 N_/ P i=0 v 7 P i=0 v 



£ - £ ri (J0 - 1 > d(G) ^ M - r(G) £ 

Ki+1 i</+l i=0 ^ 'P i=0 



To finish the proof it suffices to show that d(K) > ^j<; +1 di(K) — ^j<; +1 Ti{K). To prove 
the latter we take r 6 (0, 1) and apply Lemma fS.Mf b) to the weighted presentation (X, R, W) 
where W is the uniform weight function on Fp{X) with respect to X such that W{x) = r for 
all x £ X. The desired inequality follows by letting r tend to 1. □ 

The inequality in Proposition 18.41 was proved by Lackenby |La2l Theorem 1.6] for p = 2, 
and in a slightly weaker form for arbitrary p. Lackenby's proof was based on clever topological 
arguments, and it is remarkable that the finitary Golod-Shafarevich inequality (|5.4h yields 
the same result for p = 2. 

There are many different ways in which Proposition 18.41 may be used. A very important 
application, discovered by Lackenby for p = 2, deals with the case when r(G) < d(G) and 
K = $(G). 

Corollary 8.5. Let G be a finitely presented pro-p group withr{G) < d(G) and d(G) > 36p 2 . 
Then > \ ^fdjGjp^- 1 . 

Proof. Applying Proposition 18.41 with K = $(G) (so that n = d(G)) and I = [(p — l)n/2], 
we get > n • Q) p - ^ Note that E |+J ^ < = p n We claim 

that 



4 The above proof of Proposition 18. 41 was outlined by Kassabov during an informal discussion at the work- 
shop "Lie Groups, Representations and Discrete Mathematics" at IAS, Princeton in February 2006, before 
Theorem 15.131 was formally proved in EJ2 . 
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If p = 2, this is an easy consequence of Stirling formula, and for p > 2 this could be 
proved, for instance, as follows. Consider independent identically distributed random vari- 
ables X\ , . . . , X n which take on integer values 0, 1, . . . , p — 1 with equal probabilities 1 /p, and 
let S n = Xi + . . . + X n . Then (™) = p n ■ Prob(S n = I), and note that I = (p - l)n/2 is the 
expected value of S n . 



Each Xi has variance a = \/ (p — l)(2p — l)/6, so by the central limit theorem we have 

p™ ! ,(| S „- i |<i^)>^/ 1 



„ c -l/8 
-1/2 V2tt 

On the other hand, an easy induction on n shows that is increasing as a function of i 
for < i < I and (by symmetry) decreasing for I < i < 21 = (p — l)n. Hence 

e" 1 / 8 1 1 
Prob(S n = I) > -= > = > 



^{l + a^/E) 3 + 2p^/E 3p^ 
which yields (|8.ip . Hence 

3 o 



□ 



Remark: The inequality d(G) > 36p 2 can be significantly weakened using a more careful 
estimate. In particular, if p = 2, it is enough to assume that d(G) > 4, as proved in [La2j. 

Note that if G is a free pro-p group, then d($(G)) - 1 = (d(G) - l)p d ( G ), so the ratio 
d($(G))/d(G) guaranteed by Corollary 18.51 is not far from the best possible. In particular, 
it yields a very good bound on the subgroup growth of groups G for which d(U) > r(U) for 
every open subgroup U and d(U) > p 3 for some open subgroup U, and this class includes 
pro-p completions of all hyperbolic 3- manifold groups (see §[11] for details). 

9. Groups of positive power ^-deficiency 

In this short section we will briefly discuss groups of positive power p-deficiency which are 
close relatives of Golod-Shafarevich groups. These groups provide very simple counterexam- 
ples to the general Burnside problem, and it is quite amazing that they had been discovered 
just two years ago by Schlage-Puchta [SP] and in a slightly different form by Osin |Os2j . 

Definition. Let p be a fixed prime number. 

(i) Let F be a free abstract group. Given / € F, we let v p (f) be the largest non-negative 

integer such that / = h pVp for some h G F. 

(ii) If (X,R) is an abstract presentation, with |X| < oo, we define its power p- deficiency, 
denoted by def p (X,R) by 

def p (X,R) = \X\-l-J2p~ Up{r) - 

(hi) If G is an abstract group, its power p-deficiency def p (G) is defined to be the supremum 
of the set {def p (X, R)} where (X,R) runs over all presentations of G. 

The key property of power p-deficiency is the inequality in part (b) of the following theorem 
which is analogous to the corresponding inequalities for usual deficiency (Theorem 15.6( b)) 
and weighted deficiency (Theorem 15.7( b)). 

Recall that for a finitely generated abstract group G we set d p (G) = d(G/[G,G]G p ) and 
that d p (G) = d(G p ) where G p is the pro-p completion of G. 
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Theorem 9.1. Let G be a finitely generated abstract group. The following hold: 

(i) dp(G) > def p (G) + 1 

(ii) Let H be a subnormal subgroup of G of p-power index. Then 

defp(H) > def p (G)[G : H] 

and therefore 

A (m - 1 

>def p (G). (***) 



dp(H) - 1 
[G : H] 



Proof. Relations which are p-powers do not affect d p (G), so (i) follows from the fact that if 
G = (X\R), then d p (G) > |-X"| — \R\. To establish (ii), by multiplicativity of index it is enough 
to consider the case when H is a normal subgroup of index p. This case is covered by [SPl 
Theorem 2], and the proof of this result is quite short unlike Theorem 15. 7( b). □ 

An immediate consequence of Theorem 19.11 is that groups of positive p-deficiency are in- 
finite; in fact they must have infinite pro-p completion. Indeed, suppose that def p {G) > 0, 
but the pro-p completion of G is finite. Then there exists a minimal subnormal subgroup of 
p-power index, call it H; then d p {H) = which contradicts (***). On the other hand, it is 
clear that there exist torsion groups of positive power p-deficiency, so in this way one obtains 
a short elementary self-contained proof of the existence of infinite finitely generated torsion 
groups. 

As suggested by the titles of both [SPj and [Os2 , the original motivation for introducing 
groups of positive power p-deficiency was to find examples of torsion finitely generated groups 
with positive rank gradient. 

Definition. 

(i) Let G be a finitely generated abstract or pro-p group. The rank gradient of G is 
defined as RG(G) = inf ^g^p where H runs over all finite index subgroups of G. 

(ii) Let G be a finitely generated abstract group. The p-gradient (also known as mod p 
homology gradient) RG P (G) is defined as RG P {G) = inf dp ^Q^ 1 where H runs over 
all finite index subnormal subgroups of p-power index in G. 

Remark: Since subnormal subgroups of p-power index are precisely the subgroups open in 
the pro-p topology, it is easy to show that if G is an abstract group, then RG P (G) = RG(G p ), 
that is, the p-gradient of G is equal to the rank gradient of its pro-p completion. This also 
implies that RG P {G) = RG P (G') if G' is the image of G in its pro-p completion. 

Theorem 19 .lf ii) asserts that groups of positive power p-deficiency have positive p-gradient. 
Combining this result with theorems from [La3] and [AJNJ, one obtains the following corol- 
lary: 

Corollary 9.2 ([SP]). Let G be an abstract group of positive power p-deficiency. The fol- 
lowing hold: 

(a) G is non-amenable. Moreover, the image ofG in its pro-p completion is non-amenable. 

(b) If G is finitely presented, then G is large, that is, some finite index subgroup of G 
maps onto a non-abelian free group. 

Proof, (a) Note that G has a p-torsion quotient Q with def p {Q) > (this is proved in the 
same way as the analogous result for Golod-Shafarevich groups - see Theorem I6.2[) . Let Q' 
be the image of Q in its pro-p completion. We claim that 

RG{Q') > RG p (Q') = RQ P (Q) > def p (Q) > 0. 
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Indeed, RG(Q') > RG P (Q') since Q' is p-torsion, so every finite index normal subgroup of Q' 
is of p-power index, and therefore every finite index subgroup of Q' is subnormal of p-power 
index. The equality RG P (Q') = RQ P (Q) holds by the remark following the definition of 
p-gradient and RQ P (Q) > def p (Q) by Theorem 19. l( ii). 

Thus, Q' is a residually finite group with positive rank gradient and therefore cannot be 
amenable as proved in [A JN| . If G' is the image of G in its pro-p completion, then Qf is a 
quotient of G' , so G 1 is also non-amenable. 

(b) follows from a theorem of Lackenby [La4[ Theorem 1.18], which asserts that a finitely 
presented group with positive p-gradient is large. □ 

Corollary 9.3. There exist residually finite torsion non-amenable groups. 

Proof. If G is any torsion group of positive power p-deficiency, the image of G in its pro-p 
completion has the desired property by Corollary 19.21 □ 

Another construction of residually finite torsion non-amenable groups will be given in § [12J 
Recall that the first examples of torsion non-amenable groups (which were not residually 
finite) were Tarski monsters constructed by Ol'shanskii [Oil] (with their non- amenability 
proved in (Ol2] V 

We finish this section with a brief comparison of GS groups and groups of positive power 
p-deficiency. As suggested by the definitions, the latter class should be smaller than that of 
GS groups since in the definition of power p-deficiency only relators of the form f p , with k 
large, are counted with small weight, while in the definition of GS groups the set of relators 
counted with small weight also includes long commutators (in addition to relators of the form 
P , with k large). This heuristics suggests that groups of positive power p-deficiency may 
always be Golod-Shafarevich, and this turns out to be almost true: 

Theorem 9.4. [BuThj Let G be an abstract or a pro-p group of positive power p-deficiency. 
Then G has a finite index Golod-Shafarevich subgroup. Moreover, if p > 7, then G itself must 
be Golod-Shafarevich. 

While being a smaller class than GS groups, groups of positive power p-deficiency satisfy 
much stronger "largeness" properties, as we saw in this section. It would be interesting to 
find some intermediate condition on groups which is significantly weaker than having positive 
power p-deficiency, but has stronger consequences than being Golod-Shafarevich. 

10. Applications in number theory 

10.1. Class field tower problem. Let us begin with the following natural number-theoretic 
question: 

Question 10.1. Let K be a number field. Does there exist a finite extension L/K such that 
the ring of integers Ol of L is a PID? 

If K is a number field, the extent to which Ok fails to be a PID is measured by the ideal 
class group Cl(K). In particular, Ok is a PID if and only if Cl{K) is trivial. The group 
Cl(K) is always finite and by class field theory, Cl{K) is isomorphic to the Galois group 
Gal(M(K)/K) where W(K) is the maximal abelian unramified extension of K, called the 
Hilbert class field of K. 

Definition. Let K be a number field. The class field tower 

K = W°(K) C H x (iT) C M. 2 (K) C ... 
of K is defined by W(K) = EuTHP" 1 ^)) for i > 1. 
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The class field tower of K is called finite if it stabilizes at some step and infinite otherwise. 

Lemma 10.2. Let K be a number field. Then the class field tower of K is finite if and only 
if there is a finite extension L/K with Cl(L) = {1}. 

Proof. Let {Ki = HP (if)} be the class field tower of K. 

"=>" By assumption K n = K n+ \ for some n, so Cl(K n ) = {1} whence we can take L = K n . 

"•£=" Consider the tower of fields L = LKq C LK\ C Since for each i the extension 

Ki+i/Ki is abelian and unramified, the same is true for the extension LKi + \/ LKi. In partic- 
ular, LK\ is an abelian unramified extension of L. But Cl{L) = {1}, so L does not have such 
non-trivial extensions, which implies that LK\ = L. Repeating this argument inductively, 
we conclude that LKi = L for each i, so each K{ is contained in L, whence the tower {Ki} 
must be finite. □ 

Thus, Question 110.11 is equivalent to the so called class field tower problem. 

Problem (Class field tower problem). Is it true that for any number field K the class field 
tower of K is finite ? 

Computing the class field of a given number field is a rather difficult task. It is a little bit 
easier to control the p-class field, where p is a fixed prime. 

Definition. Let p be a prime and K a number field. 

(a) The p-class field of K, denoted by M p (K), is the maximal unramified Galois extension 
of K such that the Galois group Gal(M p (K) / K) is an elementary abelian p-group. 

(b) The p-class field tower of K is the ascending chain {W p {K)}i>Q defined by M. p (K) = K 
and W p {K) = W p {W p - l {K)) for % > 1. 

It is easy to see that M p (K) C HP (if) for any K and p, so if the p-class field tower of K is 
infinite for some p, then its class field tower must also be infinite. Let H^°(iT ) = Uj>oHP p (if ) 
be the union of all fields in the p-class field tower of K - this is easily shown to be the 
maximal unramified pro-p extension of K. Let G K , P = Gal{W^{K)/K). Thus, to solve the 
class field tower problem in the negative it suffices to find an example where the group Gx,p 
is infinite. The latter problem can be solved using Golod-Shafarevich inequality since quite 
a lot is known about the minimal number of generators and relators for the groups Gx,p- 

By definition of M p (K), the Frattini quotient Gk, p = Gk, p /[Gk,p, Gk,p\G p k is isomorphic 
to Gal (M p (K) / K) which, by the earlier discussion, is isomorphic to Cl(K)\p] = {x £ Cl(K) : 
px = 0}, the elementary p-subgroup of Cl(K). Let p p {K) = dimCl(K)[p\. Then 

(1(Gk, p ) = d(GK,p/[GK, P ,GK,p]G p Kp ) = p p (K). 

The following relation between the minimal number of generators and the minimal number 
of relators of Gx,p was proved by Shafarevich |Shl Theorem 6]. 

Theorem 10.3 (Shafarevich). Let K be a number field and v(K) the number of infinite 
primes of K. Then for any prime p we have 

< r(G K>p ) - d(G K:P ) < u(K) - 1. 

Combining Theorem 110.31 with Theorem I3.4f a) , we obtain the following criterion for the 
group Gk, p to be infinite: 



'We say that a Galois extension L/K is pro-p if the Galois group Gal(L/K) is pro-p. 
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Corollary 10.4 (Golod-Shafarevich). In the above notations, assume that 

(10.1) Pp {K) > 2 + 2vH*0 + l- 

Then the group Gk, p is Golod-Shafarevich and therefore infinite. 

To complete the negaitve solution to the class field tower problem it suffices to exhibit 
examples of number fields satisfying the number-theoretic inequality (jlO.ll) . As we explain 
below, for any prime p and n G N there exists a number field K = K(p, n) such that 
[K : Q] = p and p p (K) > n. Since v(K) < [K : Q] by the Dirichlet unit theorem, such K 
satisfies (jlO.ip whenever n > 2 + 2^/p~+T. 

For p = 2 one can simply take any set of n + 1 distinct odd primes q±, . . . ,q n +i and let 
K = Q(y/eqi . . . q n +i) where e = ±1 so that eq\ . . . q n +\ = 1 mod 4. If we choose = ±1 
for 1 < i < n + 1 so that Eiqi = 1 mod 4, then the extension Q(y / £i<?i, • • • , y/£n+iqn+\) / K is 
unramified (which can be seen, for instance, by directly computing its discriminant). Since 
this extension is abelian with Galois group (Z/2Z) n , we have P2{K) > n (it is not hard to 
show that in fact P2(K) = n). 

For an arbitary p, we take n + 1 distinct primes qi, . . . , q n +i congruent to 1 mod p, let 
Li = Q(CgJ be the qf 1 cyclotomic field and K{ the unique subfield of degree p over Q 
inside Lj. Let L = L x . . . L n+1 and M = Kx...K n+ x. Then Gal (L/Q) = ®Gal (Li/Q), 
so Gal (Af/Q) = ®Gal (KJQ) S (Z/pZ) n+l . Clearly, Gal (M/Q) has a subgroup of index p 
which does not contain Gal (-PQ/Q) for any i. Equivalently, there exists a subfield Q C K C M 
such that [iT : Q] = p and if is not contained in the compositum of any proper subset of 
{K\, . . . ,K n+ i}. We claim that each extension KKi/K is unramified. Indeed, KKi/K may 
only be ramified at qi since Li/Q (and hence K{/Q) is only ramified at q^. If KKi/K is 
ramified at qi, then M/K is ramified at qi, which is impossible since M/K = K Ylj^ Kj/K 
and Kj/Q is unramified at qi for j ^ i. Thus, each KKi is unramified over K, so their 
compositum M is also unramified over K. Since M/K is abelian with Gal(M/K) = (Z/pI*) n , 
we conclude that p p (K) > n (again, one can show that equality holds). 

10.2. Galois groups Gk, p ,s- The groups Gk, p which arose in the solution to the class field 
tower problem are very interesting in their own right and have been studied for almost a 
century, yet their structure remains poorly understood. In fact, it is more natural to consider 
a larger class of groups: given a number field K, a prime p and a finite set of primes S of K, 
denote by H^-^if) the maximal pro-p extension of K which is unramified outside of S, and 
let G K ^s = Gal{W£ s {K)/K) (so G KjP = G K ^). 

The structure of the group Gk, p ,s depends dramatically on whether S contains a prime 
above p or not. In the sequel we shall only discuss the so called tame case when S contains 
NO primes above p - this is equivalent to saying that p \ N{s) for any s E S where N : K — > Q 
is the norm function. In this case, without loss of generality we can actually assume that 
N(s) = 1 mod p for any s 6 S since primes s with N(s) ^ 0, 1 mod p cannot ramify in 
p-extensions. 

In this setting, Theorem 110.31 is a special case of the following result [Shi Theorems 1,6]: 

Theorem 10.5. Let K be a number field, p a prime, S a finite set of primes of K , and 
assume that N{s) = 1 mod p for every s G S. Let G = Gx,s,p- Then 

(i) d{G) > \S\ + 1 - v(K) - 5(K) 

(ii) < r{G) - d{G) < v(K) - 1 

where v{K) is the number of infinite primes of K and S(K) = 1 or depending on whether 
K contains a primitive p th root of unity or not. 
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In particular, if we fix K and p, the group Gk, p ,s can be made Golod-Shafarevich if \S\ is 
chosen to be large enough: 

Corollary 10.6. ( |NSW[ Theorem 10.10.1]) In the above notations assume that 

\S\ > 1 + v{K) + 2y/v(K) + 5{K). 

Then the group Gk, p ,s is Golod-Shafarevich. 

In general, if we fix K and p, the structure of the group Gk, p ,s becomes more transparent 
as S gets larger. In particular, by choosing 5 sufficiently large, one can ensure that certain 
number-theoretically defined group Vk,s 1S trivial, in which case one can write down a (fairly) 
explicit presentation for Gk, p ,s by generators and relations (see, e.g., [Kol[ Chapters 11,12]). 
Further very interesting results in this direction have been recently obtained by Schmidt 
[S^ISdi2llSch3l . 

Thus, in attempting to study the structure of the groups Gk, p ,s one might want to concen- 
trate on the case when S is sufficiently large, and in view of Corollary 110.61 one might hope 
to make use of Golod-Shafarevich techniques in this case. One important result of this flavor 
was obtained by Hajir |Haj| who proved that the group Gk, p ,s has exponential subgroup 
growth for sufficiently large S. In the next subsection we present a group-theoretic version 
of Hajir's argument in the simplest case K = Q. 

10.3. Examples with exponential subgroup growth. The following presentation for the 
groups Gq p s is a special case of a more general theorem of Koch (see |Koll § 11.4] and \Ko2\ 
§6]). 

Theorem 10.7. Let p > 2 be a prime and S = {q\, . . . , qd} a finite set of primes congruent 
to 1 mod p. Then the group G = Gq jPj s has a presentation 

(10.2) {x u ...,x d \x?=xf) 

for some elements {a{}f =1 in the free pro-p group on x±, . . . ,Xd (where as usual g h = h~ 1 gh 
for group elements g and h). 

Note that presentation (|10.2p can be rewritten as (xi, . . . ,xj \ [xi, dj] = xf~) and each 
qi — 1 is divisible by p. This implies that all relators in the presentation f j 1 . 2 j) of G lie in the 
Frattini subgroup, and so d(G) = d. We shall now show that any group with such presentation 
has an LRG chain (and therefore exponential subgroup growth) whenever d > 10. 

Proposition 10.8. Let G be a group given by a presentation of the form 

(xi, ...,Xd \ [xi,a,i] = xf A °) 

where ai G Fp(x\, . . . ,Xd) and Aj E Z p (note that d{G) = d and r(G) < d). If d > 10, then G 
has an LRG chain. 

Proof. Consider the group Q = Gj (x\, X2, ax, 0,2)1 the quotient of G by the normal subgroup 
generated by x\, X2, a\ and 02- We claim that Q is Golod-Shafarevich (hence infinite). Indeed, 
by construction d(Q) > d(G) — A = d — 4>6. Also note that Q has a presentation with d 
generators and d + 2 relations, namely 

Q = (xi, . . . ,Xd I x\ = X2 = ai = a 2 = 1, [xj,aj] = xf^ for i > 3), 

so r(Q) - d{Q) < 2 by Lemma [33£ii). Therefore, r(Q) - d(Q) 2 /4 < 2 + d{Q) - d(Q) 2 /4 < 
since d(Q) > 6. 

Now choose any infinite descending chain {Qi} of open subgroups of Q, and let Gi be 
the full preimage of Qi under the projection ir : G — > Q. Let F = F^(x\, . . . , Xd), let 
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N = ({[xi,Oi](a$ Ai ) -1 ,l < i < d}) F (so that G = F/N), and let F { be the full preimage of 
Gt in F. Note that G l = F { /N. Let m = [G : Gj] = [F : Fj. Then d(F t ) = (d - l)m + 1 
by the Schreier formula, and N is generated as a normal subgroup of Fi by the set R{ which 
consists of conjugates of elements of R by some transversal of Fi in F. By construction, each 
F{ contains the elements xi,X2, 0,1,0,2- Hence the relators [x{, ai](x^ Al ) _1 for i = 1,2 and all 
their conjugates lie in <3?(-Fj), the Frattini subgroup of Fi, and therefore do not affect d(Gj). 
The number of remaining relations in Ri is (d— 2)rij. Hence d{Gi) > (d— l)n* + l— (d— 2)nj = 
rij + 1 = [G : Gj] + 1, so {G{\ is an infinite chain with linear rank growth. □ 

Hajir |Haj| actually proved something more interesting than exponential subgroup growth 
for Gx,p,s f° r sufficiently large S — he showed that exponential subgroup growth can be 
achieved in the group Gk, p = Gk, p $ for a suitably chosen number field K depending on p. 
We now briefly outline how to construct such examples using the above presentations of the 
groups Gq >p>s . 

Let S and G be as in the statement of Theorem 110.71 Let H be any index p subgroup of 
G = Gq iPi 5 which does not contain any of the generators X{ (such H exists since all relators 
in the presentation (110. 2D lie in the Frattini subgroup), and let K C H£°g((Q)) be the fixed field 
of H. Then it is not hard to show that each prime from S ramifies in K. Note that W^{K), 
the maximal unramified pro-p extension of K, is contained in H^° S .(Q) by construction, and 
therefore, the Galois group Gal (H^° (K ) /Q) is a quotient of Gq^s- 

According to [Koll Theorem 12.1], the assumption that each prime from S ramifies in K 
ensures that G' = Gal(B.™(K)/Q) is isomoprhic to G/{x{, . . .,x p d ) G where xi, . . . , Xd are as 
in (fT02|) . so 

G' = (xi, . . . ,Xd | x\ = 1, [xi,ai] = 1 for 1 < i < d). 
One can show directly from this presentation that the group G' has an LRG chain whenever 
d > 12 and p > 11 or d > 65 and p is arbitrary - this is achieved by combining the idea of 
the proof of Proposition 110.81 with the notion of power p-deficiency discussed in the previous 
section. Since G K , P = Gal{W^(K)/K) is a subgroup of index p in G' = Gal(M£>(K)/Q), we 
conclude that Gx,p also has an LRG chain. 

Finally, we remark that the existence of an LRG chain in the group Gx,p has a very natural 
number-theoretic interpretation: it is equivalent to the existence of an infinite ascending chain 
of finite unramified p-extensions K C K\ C K2 C . . . such that the sequence {p p (K n )} n >i of 
the p-ranks of the ideal class groups of K n grows linearly with the degree [K n : K] . 

11. Applications in geometry and topology 

In this section we will discuss some applications of the Golod-Shafarevich theory to the 
study of hyperbolic 3-manifolds or rather their fundamental groups. By a hyperbolic 3- 
manifold we shall always mean a finite volume orientable hyperbolic 3-manifold without 
boundary. The fundamental groups of hyperbolic 3-manifolds are precisely the torsion- 
free lattices in PSL2(C) = SO(3,l), with cocompact lattices corresponding to compact 
3-manifolds. Arbitrary lattices in PSL2(C) (which are always virtually torsion-free) corre- 
spond to hyperbolic 3-orbifolds. 

If AT is a compact (orientable) 3-manifold and T = tti(X) its fundamental group, then by 
a result of Epstein |Ep| , T has a presentation (X,R) with \X\ > \R\. Thus defiT) > 0, and 
moreover the same is true if T is replaced by a finite index subgroup (since a finite cover of 
a compact 3-manifold is itself a compact 3-manifold). Note that by Theorem I3.4f b) . a group 
T with non-negative deficiency is Golod-Shafarevich with respect to a prime p whenever 
d(I»>5. 
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Lubotzky [Lu2j proved that if T is a finitely generated group which is linear in characteristic 
different from 2 or 3 and not virtually solvable, then for any prime p the set {d(Ap) : A is 
a finite index subgroup of T} is unbounded. If X is hyperbolic, then T = tt±(X) is linear 
and not virtually solvable (being a lattice in PSL2(C)). Thus, the discussion in the previous 
paragraph implies the following: 

Proposition 11.1 ( |Lulj ). Let X be a hyperbolic 3-manifold and T = tti(X). Then for every 
prime p, T has a finite index subgroup which is Golod-Shafarevich with respect to p. 

Proposition 111.11 was established in 1983 Lubotzky's paper [Lul| as a tool for solving 
a major open problem, known at the time as Serre's conjecture. The conjecture (now a 
theorem) asserts that arithmetic lattices in SL2(C) do not have the congruence subgroup 
property. The proof of this conjecture is a combination of three results: 

(a) Proposition 11 1 . ll 

(b) If r is an arithmetic group with the congruence subgroup property, then for any prime 
p, the pro-p completion Tp is p-adic analytic. 

(c) Golod-Shafarevich pro-p groups are not p-adic analytic. 

Lubotzky established part (c) using Lazard's theorem |Laz| which asserts that a pro-p 
group is p-adic analytic if and only if the coefficients of the Hilbert series Hilb^^Q^^it) grow 
polynomially, where d is the standard degree function. By Corollary 14. 3\ this cannot happen 
in a Golod-Shafarevich group. Now one can give simple alternative proofs of (c) thanks 
to many new characterizations of p-adic analytic obtained after [Lulj . For instance, a pro-p 
group is p-adic analytic if and only if the set {d(U) : U is an open subgroup of G} is bounded 
(see, e.g., [LuMn] or [DDMS, § 3,7]). This also prevents G from being Golod-Shafarevich, 
e.g., by Theorem 18.31 

Lubotzky's work provided the first (and very non-trivial) geometric application of Golod- 
Shafarevich groups and gave hope that even deeper problems about 3-manifolds could be 
tackled in the same way. Indeed, suppose one wants to prove that hyperbolic 3-manifold 
groups always have certain property (P) and (P) is inherited by finite index subgroups and 
overgroups. In view of Proposition 111.11 to prove such a result it is sufficient to show that 
every Golod-Shafarevich group has property (P). 

One of the main open problems about 3-manifolds is the virtually positive Betti number 
(VPBN) conjecture due to Thurston and Waldhausen: 

Conjecture 11.2 (VPBN Conjecture). Let M be a hyperbolic 3-manifold. Then M has 
a finite cover with positive first Betti number. Equivalently, %\{M) does not have property 
(FAb), that is, ni(M) has a finite index subgroup with infinite abelianization. 

This conjecture clearly cannot be settled just using Proposition II 1 . ll since there exist tor- 
sion Golod-Shafarevich groups. However, Golod-Shafarevich theory seemed to be a promising 
tool for attacking a weaker conjecture of Lubotzky and Sarnak. 

Conjecture 11.3 (Lubotzky-Sarnak). Let M be a hyperbolic 3-manifold. Then tt\{M) does 
not have property (r). 

For the definition and basic properties of Kazhdan's property (T) and its weaker (finitary) 
version property (r) we refer the reader to the books [BHV] and [LuZ| . 

A finitely generated group with property (r) must have (FAb), so Conjecture 111.21 would 
imply Lubotzky-Sarnak Conjecture. The latter was originally posed not because of its intrin- 
sic value, but with the hope that it may be easier to settle than VPBN conjecture, while its 
solution may shed some light on VPBN conjecture. 



GOLOD-SHAFAREVICH GROUPS: A SURVEY 



41 



It seemed quite feasible that Lubotzky-Sarnak conjecture might be solved using Golod- 
Shafarevich approach, that is, it may be true that Golod-Shafarevich groups never have 
property (r). The latter, however, turned out to be false, as explicit examples of Golod- 
Shafarevich groups with property (r) (actually, with property (T)) were constructed in [Erl| . 

These examples still leave a possibility that Lubotzky-Sarnak conjecture (or even VPBN 
conjecture) could be solved by group-theoretic methods since it is easy to identify group- 
theoretic properties which hold for hyperbolic 3-manifold groups and which clearly fail for all 
known examples of Golod-Shafarevich groups with property (t). Unfortunately, at present 
there seems to be no group-theoretic conjecture which would imply Lubotzky-Sarnak conjec- 
ture and which could be attacked with currently known methods. 

Nevertheless, Golod-Shafarevich techniques did yield new important results about 3-manifold 
groups. Perhaps the most interesting of those are two results of Lackenby dealing with sub- 
group growth. 

11.1. Subgroup growth of 3-manifold groups. In [Lal| and |La2| . Lackenby obtained 
strong lower bounds on the subgroup growth of hyperbolic 3-manifold groups. The first result 
asserts that for any hyperbolic 3-manifold group, the subgroup growth function is bounded 
below by an almost exponential function on an infinite subset of N. 

Theorem 11.4. ([La2]) Let T be the fundamental group of a hyperbolic 3-manifold, and let 
a n (T) be the number of subgroups of index n in T. Then a n (T) > 2 n /( v/logn ' log ( logn )) f or 
infinitely many n. 

This result follows by a direct (though not completely straightforward) computation from 
Corollary 18.51 and Lemma 18.11 for p = 2 (see \L&2\ § 6, Claim 2] for details) applied to the 
pro-p completion of T. (We note that by [Lu2| . the assumption d p (T) = d(T~) > 4 can always 
be achieved replacing T by a finite index subgroup). The proof of Corollary 18.51 (which is an 
algebraic result) in |La2| uses topological techniques, but the alternative proof given in this 
paper is purely algebraic and based on the finitary Golod-Shafarevich inequality. 

The second result of Lackenby asserts that for a large class of hyperbolic 3-manifolds the 
subgroup growth is at least exponential: 

Theorem 11.5 ([Lai]). Let M be a hyperbolic 3-manifold which is commensurable with an 
orbifold O with non-empty singular locus. Let p be any prime such that tti{0) has an element 
of order p. Then tti(M) has an LRG p- chain and hence has at least exponential subgroup 
growth. 

Unlike Theorem lll.4| it does not seem possible to give an entirely algebraic proof of 
Theorem 111.51 (although a substantial part of the argument in [Lal| is group-theoretic). For 
this reason we do not discuss the proof of this result in this paper and refer the reader to 
a very clear exposition in [Lal| . However, we do remark that there are many similarities 
between the proof of Theorem 111.51 and that of Proposition 110.81 (in fact, the latter was 
inspired by the former). 

Another interesting application of Golod-Shafarevich inequality to 3-manifold groups (specif- 
ically, to the structure of their rational lower central series) was obtained by Freedman, Hain 
and Teichner |FHT| . 

12. Golod-Shafarevich groups and Kazhdan's property (T) 

12.1. Golod-Shafarevich groups with property (T). In the previous section we dis- 
cussed why the question of the existence of Golod-Shafarevich groups with property (r) was 
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important in topology, or rather, why the lack of such groups would have been very useful. 
This question, however, is quite natural from a purely group-theoretic point of view as well, 
and when the question was open, one could present natural heuristic arguments for both 
non-existence and existence of such groups. On the one hand, as we already saw, every 
Golod-Shafarevich group has a lot of quotients (including finite quotients), seemingly too 
many for such a group to have property (r). On the other hand, Golod-Shafarevich groups 
behave similarly to hyperbolic groups in many ways, and there exist hyperbolic groups with 
property (T) (hence also property (t)). A posteriori, it seems that the latter heuristics 
was the "right one", at least it predicted the right answer, although the actual examples of 
Golod-Shafarevich groups with property (T) are completely different from the examples of 
hyperbolic groups with (T). 

The first examples of Golod-Shafarevich groups with property (T) were constructed in 
|Erl] as positive parts of certain Kac-Moody groups over finite fields. Property (T) for 
such groups was established earlier by Dymara and Januszkiewicz |DJ] . while the Golod- 
Shafarevich condition was verified using certain optimization of the Tits presentation of such 
groups. We shall not discuss this construction since much simpler to describe examples of 
Golod-Shafarevich groups with (T) were given in [EJlj . 

Theorem 12.1. |EJ1] Let p be a prime and d > 2 an integer, and consider the group 
G P ,d = (xi, ■ ■ ■ ,x d | x\ = 1, [xi,Xj,Xj] = 1 for 1 < % ^ j < 9). 

Then 

(i) The group G p ^ is Golod-Shafarevich with respect to p whenever p > 3 and d > 9 or 
p = 2 and d > 12. 

(ii) The group G P: d has property (T) whenever p > (d — l) 2 . 

In particular, for any p > 67, there exists a Golod-Shafarevich group ( with respect to p) with 
property (T). 

Part (i) is established by direct verification: indeed, if (X, R) is the presentation of G p ^ 
given above, then 1 — Hx{t) + Hr(t) = 1 — dr + d(d — l)r 3 + dr v , which is negative for 
t = 2/d under the required conditions on p and d. 

Part (ii) is proved using a general criterion for property (T) from |EJ1] (see Theorem 1 12. 2 1 
below). 

Definition. Let H and K be subgroups of the same group. The orthogonality constant 
orth{H, K) is defined to be the smallest e > with the following property: if V is a unitary 
representation of the group (H, K) without nonzero invariant vectors, v £ V is /f-invariant 
and w S V is ii'- invariant, then |(«,t/;}| < e[|u[| ||u;||. 

Theorem 12.2. ( [EJT| Theorem 1.2]) Suppose that a group G is generated by n finite sub- 
groups H\, . . . ,H n , and for each l<ij^j<nwe have orth(H{, Hj) < Then G has 
property (T). 

The wonderful thing about this criterion is that the orthogonality constant orth(H, K) is 
completely determined by the representation theory of the subgroup (H, K); in fact, it suffices 
to consider only irreducible representations. If G = G p d is a group from Theorem 112.11 we 
let Hi = (xi) for 1 < i < d. For any i ^ j, the group (H{,Hj) is isomorphic to the 
Heisenberg group over F p which has very simple representation theory, and one easily shows 
that orth(Hi, Hj) = l/sjp. Therefore, by Theorem ll2.21 G p ^ has (T) whenever p > [d— l) 2 . 
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Remark: The "Kac-Moody examples" with property (T) from [Erl] are quotients of the 
groups G Pt d- These groups are Golod-Shafarevich under stronger assumptions on p and d 
than the ones given in Theorem 112.11 an d it takes some work to verify the Golod-Shafarevich 
condition for these groups. 

12.2. Applications. In terms of potential applications to 3-manifolds, the existence of Golod- 
Shafarevich groups with property (T) was a "negative result" . However, it turned out to be 
a very useful tool for constructing examples of groups with exotic finiteness properties. For 
instance, it immediately implies the existence of residually finite torsion non-amenable groups. 

Theorem 12.3. |Erl| There exist residually finite torsion non- amenable groups. 

Proof. Let G be a Golod-Shafarevich group with property (T). By Theorem 16. 2^ G has a 
torsion quotient G' which is also Golod-Shafarevich. Hence the image of G' in its pro-p 
completion, call it G", is infinite. Then G" is a torsion residually finite group which has 
property (T) (being a quotient of G). Since an infinite group with (T) is non-amenable, we 
are done. □ 

Remark: Recall that another construction of residually finite torsion non-amenable groups 
due to Schlage-Puchta and Osin was described in § [9j 

Golod-Shafarevich groups with (T) also provide a very simple approach to constructing 
infinite residually finite groups which have (T) and some additional property (P) via the 
following observation. 

Observation 12.4. Let (P) be a group-theoretric property such that every Golod-Shafarevich 
group has an infinite residually finite quotient with (P) . Then there exists an infinite resid- 
ually finite group which has (P) and (T). 

Recall that several properties (P) satisfying the hypothesis of Observation 1 1 2 . 41 were stated 
in § EJ Applying Observation 112.41 to those properties, we obtain the following results: 

Proposition 12.5. ( [EJ31 Theorem 1.3]) There exists an infinite LERF group with (T). 

Proposition 12.6. [Er2] There exists a residually finite group with (T) whose FC-radical 
(the set of elements with finite conjugacy class) is not virtually abelian. 

Prop osit ion 1 1 2 . 51 answers a question of Long and Reid [LR] which arose in connection with 
the study of property LERF for 3-manifold groups while Proposition 112.61 settled a question 
of Popa and Vaes |PV| coming from measurable group theory. 

12.3. Kazhdan quotients of Golod-Shafarevich groups. In this subsection we discuss 
the proof of the following theorem: 

Theorem 12.7. ([EJ2] Theorems 1.1, 4.6]) Every generalized Golod-Shafarevich group has 
an infinite quotient with Kazhdan's property (T). 

While the fact that Golod-Shafarevich groups with (T) exist was somewhat surprising, 
once it was established, it was natural to expect that the assertion of Theorem 112.71 is true, 
and this was explicitly conjectured by Lubotzky. The conjecture was partially motivated 
by the theory of hyperbolic groups where the analogous result was known to be true: every 
(non-elementary) hyperbolic group has an infinite quotient with property (T), which follows 
directly from two deep theorems: 

(a) There exists a hyperbolic group with property (T). 

(b) Any two hyperbolic groups have a common infinite quotient. 
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In fact, this analogy suggests a naive approach to Theorem 112.71 Theorem 112.71 would follow 
from Theorem 112.11 at least for p > 67, if one could show that any two GGS groups (with 
respect to the same p) have a common infinite quotient. The latter is of course too much 
to expect, even if we consider GS groups instead of GGS groups (we do not know explicit 
counterexamples at this point, but there is little doubt that such counterexamples exist). 
Nevertheless, one could still try to show that for any GGS group G there is another GGS 
group H with (T) such that G and H have a common infinite quotient - if true, this would 
still imply Theorem 112.71 

In order to implement this approach, one needs to possess a large supply of GGS group 
with (T). The class of groups described in Theorem 112.11 is way too small for this to work, 
but using essentially the same method one can construct more groups with this property. 

Theorem 12.8. (see [EJ21 Theorem 4.2]) Let p be a prime, d > an integer and m, . . . , 
positive integers. Consider the group G given by the presentation {XkmSi Rkms) where 
Xrms = {xi,k ■ 1 < i < d, 1 < k < n d } and Rkms = { [ •^i,ki •Ej,m 

Mori / j} U 

{[cc^, Xij]} U {x p ik \. Ifd>9 and p > {d— I) 2 , then G is GGS and has property (T). 

Remark: The groups described in this theorem are called Kac-Moody-Steinberg groups in 
|EJ1| since they map onto suitable Kac-Moody groups over ¥ p as well as certain Steinberg 
groups. This explains the notations Xkms and Rkms for the sets of generators and relators. 

This class of groups is still insufficient to make the naive approach work, but a more 
convoluted scheme based on the same idea does work. We shall now outline the argument. 

First we reduce the problem to the following: 

Theorem 12.9. Every generalized Golod-Shafarevich group has a finite index subgroup 
which has an infinite quotient with Kazhdan's property (T). 

The reduction is possible due to the following general statement: 

Proposition 12.10. Let (P) be a group-theoretic property, which is preserved by quotients, 
finite direct products, finite index subgroups and finite index overgroups. Let G be a group, 
and suppose that some finite index subgroup of G has an infinite quotient with (P) . Then G 
itself has an infinite quotient with (P). 

Proposition ll2.10l was proved by Jaikin-Zapirain in the case (P) = (T) (see [EJ2j, Prop. 4.5]), 
but as observed in [BuThl Prop. 3.5], the same argument applies to any property (P) as above. 

Proof of Theorem \l2.9( sketch). We shall restrict ourselves to the case p > 67; the proof in 
the case p < 67 is similar, but more technical. Let I be a generalized Golod-Shafarevich 
abstract group; without loss of generality we can assume that T is residually-p. Let G = Tp 
be the pro-p completion of T and W a valuation on G such that defjy(G) > 0. The proof of 
Theorem 112.91 consists of four main steps. 

Step 1: Given M £ R, find an open subgroup H of G such that defyy(H) > M. 
Then we can find a weighted presentation (X, R, W) of H (where W induces W) such that 
def w (X,R) > M. 

Step 2: Given real numbers w > 1 and e > 0, show that there is a real number f(w, e) 
such that if in Step 1 we take M > f(w,e), then there is another weighted presentation 
(X', R', W) of H, with X' C Fp(X), such that W'(X') = w, W'(R') < e, W'{x) < e for all 
x £ X' and W\h) < W(h) for all h G F^(X'). 

Step 3: Show that if w and e in Step 2 are suitably chosen, then there is a group A with 
(T) from the family described in Theorem 112.81 such that 
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(i) the canonical set of generators Xkms = { x ij} °f A has the same cardinality as X' 
from Step 2, so there is a bijection (thought of as identification) a : Xkms — > X' . 

(ii) If Rkms is the canonical set of relators of A, we can choose a : Xkms X' in such 
a way that defw'(X' , R' U Rkms) > 0, so the pro-p group Q = (X' \ R' U Rkms) is 
GGS. 

Step 4-' Let A be the image of T n H in Q, and let A' be the subgroup of Q abstractly 
generated by X'. Note that A' is a quotient of A. By Lemma [6 .71 (Tails Lemma), we can find 
a GGS quotient Q' of Q in which the images of A and A' coincide; call their common image 

n. 

We claim that £1 satisfies the conclusion of Theorem 112.91 Indeed, by construction, $7 is a 
quotient of T n H (which is a finite index subgroup of T) and has (T) being a quotient of A. 
Finally, 0, is infinite being a dense subgroup of the GGS pro-p group Q'. 

We now comment briefly on the proof of each step. Step 4 has already been fully explained. 
Recall that def^(H) > defyy(G) ■ [G : H]^ for any open subgroup H by Theorem 15.7( b) 
and the H^-index [G : H]yy can be made arbitrarily large by Corollary 15.111 This justifies 
Step 1. 

The key tool in Step 2 is the notion of contraction of weight functions. 

Definition. Let F be a free pro-p group, W a weight function on F and c > 1 a real number. 
Choose a W-iree generating set X of F, and let W be the unique weight function on F with 
respect to X such that W'{x) = W{x)/c for all x G X. We will say that the function W' 
is obtained from W by the c-contraction. (It is easy to see that W does not depend on the 
choice of A"). 

In order to understand better what a contraction does we go back to weight functions on 
power series algebras. By definition, the initial weight function W is given by W(f) = w(f—\) 
where hi is a weight function on F^fi 7 ]] with respect to U = {x — 1 : x £ X}. Then the 
contracted weight function W' can be defined by W'(f) = w'(f — 1) where w' is the unique 
weight function on F p [[F]] with respect to U such that w'(u) = w{u)/c for all u £ U. Recall 
that F p [[F]] = ¥ p ((U)). It is clear that for any a £ ¥ p [[F]] such that the degree of a as a 
power series in U is at least k we have w'(a) < w(a)/c k . In particular this implies that 

(i) W'(f) < W{f)/c for all / e F; 

(ii) W'(f) < W(f)/c 2 for all / G = [F,F]F P . 

Let us now go back to the setting of Step 2. Assume first that all elements of R lie in 
$(F). If we obtain W' from W by the c-contraction for c = W(X)/w, then W'(X) = w 
and W'(R) < W(R)/c 2 by (ii). Since W(R) < W(X) and W{X) > M, we get W'(R) < 
W (R)w 2 /W (X) 2 < w 2 /M and W'(x) < w/M for all x € X. Thus in this case we can simply 
set X' = X , R' = R and f(w,e) = max{ty 2 /e, w/e}. 

In general, the situation is more complex. Note that starting with the presentation (X, R), 
we can eliminate some of the relators together with the corresponding generators (using the 
procedure described in Lemma l3.5f ii)). so that in the new presentation all relators lie in the 
Frattini subgroup; unfortunately, during this operation the weighted deficiency may increase. 
In order to resolve this problem, one needs to apply a contraction, followed by elimination of 
some of the relators, followed for the second contraction. For the details we refer the reader 
to [EJ2l Theorem 3.15]. 

Finally, we turn to Step 3. Here the precise form of relators in Rkms plays an important 
role. Let X; L = for 1 < i < 9, so that Xkms = UXj. The key property is that 

the presentation (Xkms, Rkms) is very symmetric, and therefore defv{XKMS, Rkms) > 



GOLOD-SHAFAREVICH GROUPS: A SURVEY 



46 



for many different weight functions V. A direct computation (see the proof of Theorem 4.3 
in [EJ2]) shows that def v {X KMS , R KMS ) > 1/50 whenever V(X KMS ) = £ti V(Xi) = 3/2 
and all subsets X\,... , Xq have approximately equal V- weights; more precisely, it is enough 
to assume that \V(Xi) - 1/6| < 1/100 for 1 < i < 9. 

Now take w = 3/2 and e = 1/100 in Step 2. Then we can divide the generators from X' 
into 9 subsets such that the total iy'-weight in each subset differs from 1/6 = (3/2) /9 by 
less than 1/100. Letting Xj be the i th subset, we obtain an identification a between X' and 
X KMS - Then def W '{X',R' U R KMS ) > def w >(X KMS , Rkms) ~ W'(R') > 1/50 - e > 0, so 
conditions (i) and (ii) from Step 3 are satisfied. □ 

13. Residually finite monsters 

The following famous theorem was proved by Ol'shanskii in 1980: 

Theorem 13.1 (Ol'shanskii, |Q11| ). For every sufficiently large prime p there exists an 
infinite group T in which every proper subgroup is cyclic of order p. 

Groups satisfying the above condition are called Tarski monsters, named after Alfred Tarski 
who first posed the question of their existence. Tarski monsters satisfy a number of extremely 
unusual properties. However, they are not residually finite (as they do not have any proper 
subgroups of finite index), and it is a common phenomenon in combinatorial group theory 
that residually finite finitely generated groups are much better behaved than arbitrary finitely 
generated groups. Thus it is interesting to find out how close a residually finite group can 
be to a Tarski monster. In particular, the following natural question was asked by several 
different people. 

Problem. Let p be a prime. Does there exist an infinite finitely generated residually finite 
p-torsion group in which every subgroup is either finite or of finite index? 

This problem remains completely open, except for p = 2 when non-existence of such groups 
was known since 1970s and in fact can be proved by a very elementary argument (see |EJ3[ 
§ 8.1] and references therein). However, in |EJ3I , Golod-Shafarevich techniques were used 
to prove the existence of residually finite groups which satisfy the condition in the above 
problem for all finitely generated subgroups: 

Theorem 13.2. For every prime p there exists an infinite finitely generated residually finite 
p-torsion group in which every finitely generated subgroup is either finite or of finite in- 
dex. Moreover, every (abstract) generalized Golod-Shafarevich group (with respect to p) has 
a quotient with this property. 

13.1. Sketch of the proof of Theorem 113.21 The basic idea behind constructing such 
groups is very simple. Let T be a generalized Golod-Shafarevich group. Without loss of 
generality, we can assume right away that T is p-torsion and residually-p, so we can identify 
r with a subgroup of G = Yp There are only countably many finitely generated subgroups of 

r, so we can enumerate them: Ai, A2, At the first step we construct an infinite quotient 

G\ of G such that if tti : G — > G\ is the natural projection, then 7Ti(Ai) is either finite or 
has finite index in vri(r); note that the latter condition will be preserved if we replace G\ by 
another quotient. Next we construct an infinite quotient T2 of T\ such that if 1x2 '■ G — > G2 
is the natural projection, then 7T2(A2) is either finite or of finite index in ^(T). We proceed 
in this way indefinitely. Let Goo = lim Gj ; in other words, if Gi = G/Ni (so that the chain 
{Ni} is ascending), we let = UNi, the closure of UiVj, and G^ = G/N^. Since each G. L is 
infinite, G^ must also be infinite (otherwise iVoo is of finite index in G, hence it is a finitely 
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generated prop group, which easily implies that Noo = Ni for some i). Let be the image 
of T in Goo- By construction, Too is p-torsion, and each of its finitely generated subgroups is 
finite or of finite index. Finally, r M is residually finite being a subgroup of G^ and infinite 
being dense in Goo, so it satisfies the required properties. 

So, we just have to make sure that a sequence {Gi} as above can indeed be constructed. 
Things would have been really nice if at each step we could make Gi a GGS group. We do 
not know how to achieve this, and in order to resolve the problem we have to extend the class 
of GGS groups even further. 

Definition. A pro-p group G will be called a pseudo-GGS group if there exist an open normal 
subgroup H of G and a finite valuation W on H such that 

(i) defw(H) > (so, in particular, H is a GGS group). 

(ii) The function W is G-invariant, that is, W(h 9 ) = W(h) for all h & H and g € G. 

Remark: Pseudo-GGS groups are called groups of positive virtual weighted deficiency in 
[EJ3] . 

We will need a simple lemma which generalizes Lemma 16.11 

Lemma 13.3. Let G be a pseudo-GGS group, and let H and W satisfy conditions (i) and 
(ii) above. Let S be a subset of H and let G' = G/{S) G . IfW(S) < def w {H)/[G : H], then 
G' is also a pseudo-GGS group. 

Proof. Let T be a transversal of H in G and let H' be the image of H in G' . Then H' = 
H/{S') H where S' = {s* : s £ S, t £ T}. Since W is G-invariant, W{S') < W(S)\T\ = 
W(S)[G : H], so defw(H') > by Lemma EU Thus, G' is also a pseudo-GGS group with 
H' satisfying conditions (i) and (ii) above. □ 

The following result is a key step in the proof of Theorem 113.21 

Theorem 13.4. Let G be a pseudo-GGS pro-p group, T a finitely generated dense subgroup 
of G and A a finitely generated subgroup ofT. Then there exists an epimorphism tt : G — > Q 
such that 

(i) Q is a pseudo-GGS pro-p group; 

(ii) vr(A) is either finite or has finite index in tt(T). 

Theorem 113.41 ensures that we can make each step in the above iterated algorithm, and 
therefore we have now reduced Theorem 113.21 to Theorem 113.41 



Sketch of the proof of Theorem \13.4\ Let H be an open normal subgroup of G and W a 



valuation on H from the definition of a pseudo-GGS group. By the Tails Lemma, we can 
assume that TnH is abstractly generated by X. Also, replacing A by its finite index subgroup, 
we can assume that A C H. 

Let L be the closure of A. Which of the two alternatives in the conclusion of Theorem 113.41 
will occur depends on whether the H^-index [H : L]w is infinite or finite. 

Case 1: [H : L]w < oo. In this case, by multiplicativity of P^-index (Proposition I5.8P 
and Continuity Lemma (Proposition 15.91) . for any given e > we can find an open subgroup 
U of H containing L such that [U : L]w < 1 + e. This easily implies that there exists a 
subset X e of U such that W(X £ ) < e and U is generated by L and X e . The latter condition 
implies that if we let Q = G/{X £ ) G and let tt : G — > Q be the natural projection, then 
7r(L) = 7r(U), so tt(L) must be of finite index in Q. On the other hand, by Lemma 113.31 if 
we take e < defw(H)/[G : H], then Q = tt(G) is a pseudo-GGS group, as desired. 
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Note that if G was a GGS group, then Q = tt(G) would also be a GGS group, so if Case 1 
always occurred, we would not need to consider pseudo-GGS groups at all. It is Case 2 where 
such generalization is needed. 

Case 2: [H : L]w = oo. In this case we start with an important subcase: 

Subcase: rkyy(L) < 1. In this subcase we construct the desired quotient Q exactly as in the 
proof of Theorem 13.31 Note that the assumption [H : L] w = oo is not explicitly used in the 
proof; however, it is already implied by the assumption rkw(L) < 1. 

If rkw(L) > 1, the first thing we can try is to replace W by the valuation W' obtained 
from W by c-contraction for some c > 1 (recall that c-contractions were defined in Step 2 of 
the proof of Theorem 112. 9p . More precisely, we choose a weight function W which induces 
W, let W be the c-contraction of W and then induce the valuation W' from W' . One can 
show that if W is suitably chosen, the valuation W will still be G-invariant (see [EJ3( Prop. 
4.13]). If we take c > rkw(L), then clearly rkw'(L) < 1; the problem is that the deficiency 
defw'(H) may become negative; more precisely, we can only guarantee that defw'(H) > 
if defw(H) > rk w {L). 

To overcome this problem we proceed as follows. Using the assumption [H : L]w = oo, it is 
not hard to show that for any descending chain {Ui} of open subgroups of H with fit/, = {1}, 
the quantity .^J^Lnu ) § oes t° infinity. This follows from Theorem 15.71 Continuity Lemma 
and multiplicativity of W^-index. In particular, we can find U C H which is open and normal 
in G for which rkw(L n U) < defw{U). Thus, if we let W be the valuation on U (not 
on H) obtained from W by the c-contraction, where rk\y{L D U) < c < defw(U), then 
rk\v'(L n U) < 1 and defw'(U) > 0. Now we can finish the proof as in the above subcase 
with W replaced by W and H replaced by U. □ 

14. Open questions 

In this section we pose several open problems about Golod-Shafarevich groups. All these 
questions make sense for generalized Golod-Shafarevich groups as well, but with the exception 
of Problem [U it does not seem that answering them for GGS groups would be easier or harder 
or more interesting than for GS groups. For each problem we provide brief motivation and 
discuss related works and conjectures. Our list has some overlap with the list of problems in 
a paper of Button |Bu| . 

Problem 1. Let G be a finitely presented Golod-Shafarevich abstract group. Does G contain 
a non-abelian free subgroup? 

Recall that Golod-Shafarevich pro-p groups contain non-abelian free pro-p groups, even if 
not finitely presented. In the abstract case there exist Golod-Shafarevich torsion groups, so 
an additional assumption about the group is needed to ensure the existence of a non-abelian 
free subgroup. We conjecture that the answer to Problem [T] is positive, although we are 
unaware of any promising approach to it at the moment. 

Problem 2. Let G be a Golod-Shafarevich abstract group with a balanced presentation (a 
presentation with the same number of generators and relators). Is G necessarily large? 

The main motivation for this problem comes from 3-manifold topology. Lackenby posed 
a stronger form of the virtual positive Betti number conjecture asserting that if G is the 
fundamental group of a hyperbolic 3-manifold, then G must be large. As explained in § [Til 
such G must have a finite index subgroup which is Golod-Shafarevich and has a balanced 
presentation, so a positive answer to Problem[2]would settle Lackenby's conjecture. In fact, to 
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settle the latter it is enough to answer Problem 2 in the positive under the stronger assumption 
that every finite index subgroup of G has a balanced presentation. Unfortunately, even in 
this form the problem remains wide open and there are no strong indications that the answer 
should be positive. 

Problem 3. Let G be a Golod-Shafarevich abstract group with a balanced presentation. Is it 
true that G does not have (FAb)? 

Recall that G is said to have (FAb) if every finite index subgroup of G has finite abelian- 
ization, so a positive answer to Problem [2] would, of course, imply the same for Problem [3l 
Settling Problem [3] in the affirmative would still be an amazing result - even under the extra 
hypothesis that every finite index subgroup of G has balanced presentation, it would imply 
the virtual positive Betti number conjecture. 

We remark that the analogue of Problem [3] for pro-p groups has negative answer - as 
explained in § [TOJ the Galois group Gq jPj s has a balanced (pro-p) presentation and is Golod- 
Shafarevich, provided IS"! > 5 and all primes in S are congruent to 1 mod p, but also has 
(FAb) by class field theory. Note though that finite index subgroups of the groups Gq )P) s do 
not necessarily have balanced presentations. 

Finally, note that Problem [3] (and hence also Problem [2]) would have negative answer 
if we only assumed that G is finitely presented (not assuming the existence of a balanced 
presentation). Indeed, the groups described in Theorem 112.11 are finitely presented Golod- 
Shafarevich groups which have property (T) and therefore (FAb) as well. 

Problem 4. Let G be a GGS pro-p group and W a valuation on G such that defw{G) > 0. 
Does G always have a closed subgroup H of finite W -index such that H can be mapped onto 
a non-abelian free pro-p group? 

Problem S] should be considered as a fancy pro-p analogue of Baumslag-Pride theorem, as 
we now explain. 

Baumslag-Pride theorem |BPj asserts that if G is an abstract group of deficiency at least 
two (that is, G has a presentation with two more generators than relators), then G is large. 
Several people independently asked if Baumslag-Pride theorem remains true for pro-p groups, 
that is, if a pro-p group of deficiency at least two has an open subgroup mapping onto a non- 
abelian free pro-p group. It is clear that the proof of Baumslag-Pride theorem in the abstract 
case cannot possibly be adapted to pro-p groups. The reason is that if G is an abstract group 
with def(G) > 2, the index of a finite index subgroup H of G, which is guaranteed to map 
onto a non-abelian free group, depends on the word length of relators of G, and in the pro-p 
case relators may be words of infinite length. In fact, most experts believe that the analogue 
of Baumslag-Pride theorem for pro-p groups should be false, although no counterexamples 
(or even potential counterexamples) have been constructed. 

Problem |4] is a "weighted substitute" for Baumslag-Pride theorem for pro-p groups: we 
consider a larger class of groups replacing the condition def(G) > 2 by its weighted analogue 
defw(G) > 0, but also relax the assumption on the subgroup H, only requiring finite W- 
index. 

We remark that a positive answer to Problem [4] would yield a new solution to Zelmanov's 
theorem about the existence of non-abelian free pro-p subgroups in Golod-Shafarevich pro-p 
groups. 

Problem 5. Let G be a Golod-Shafarevich pro-p group. Is G SQ-universal, that is, does 
every countably based pro-p group embed into some (continuous) quotient of G? 
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Recall that an abstract group is called SQ-universal if any finitely generated group (and 
hence any countable group) embeds into some quotient of G. If G is an abstract (resp. pro-p) 
group which maps onto a non-abelian free (resp. free pro-p) group, then G is obviously SQ- 
universal. A result of Hall and Neumann [Nej shows that in the abstract case SQ-universality 
extends to overgroups of finite index (we expect that the same is true for pro-p groups), 
and therefore abstract groups of deficiency at least two are SQ-universal by Baumslag-Pride 
theorem. 

While the validity of Baumslag-Pride theorem for pro-p groups is highly questionable, it 
is reasonable to conjecture that pro-p groups of deficiency at least two are still SQ-universal. 
It is less likely that SQ-universality holds for all Golod-Shafarevich groups, but we do not 
see any obvious indications of why this should be false. We note that the existence of torsion 
Golod-Shafarevich abstract groups means that Problem [5] would have negative answer in the 
category of abstract groups. 

Problem 6. Find a Golod-Shafarevich group of subexponential subgroup growth. 

There is almost no doubt that such groups exist. In fact, we expect that Golod-Shafarevich 
groups with property (T) described in Theorem 112.11 have subexponential subgroup growth. 
In any case, it would be interesting to compute (or at least estimate) subgroup growth for 
these groups. In the unlikely case that their subgroup growth is (at least) exponential, 
these groups would provide the first examples of Kazhdan groups with (at least) exponential 
subgroup growth. 

Problem 7. Find an interesting intermediate condition between being virtually Golod-Shafarevich 
and having positive power p-deficiency. 

This problem has already been discussed at the end of § [9j 

Problem 8. Establish new results about Golod-Shafarevich groups in characteristic zero. 

Let be the class of (abstract) groups which are Golod-Shafarevich in characteristic 
zero (see § 13.41 for the definition). Recall that every group in £1 is also Golod-Shafarevich 
with respect to p for every prime p, and it seems that all known results about groups in 
Q follow from that fact. One obvious consequence is that given a group G in fi, for ev- 
ery n G N and every prime p there exists a finite index subgroup H = H(n,p) of G s.t. 
d p {H) = d(H/[H,H]H p ) > n. It is natural to ask whether one can find such H(n,p) which 
is independent of p. Equivalently, is it true that for every n £ N, there exists a subgroup 
H = H{n) of G s.t. d(H ab ) = d(H/[H, H]) > n; in other words, does G have infinite virtual 
first Betti number? 

The latter question is particularly interesting for free-by-cyclic groups F x Z (with F free 
non-abelian). As mentioned at the end of § 13.41 a group G of this form is GS in characteristic 
zero whenever its first Betti number is at least two (this is equivalent to saying that G maps 
onto Z 2 ). 

Problem 9. Find a "direct" proof of non- amenability of Golod-Shafarevich groups. 

Recall that in [EJ2] . non-amenability of GS groups follows from the fact that they possess 
infinite quotients with property (T) which, in turn, depends on the existence of a very concrete 
family of groups with property (T) (described in Theorem I12.8[) which happen to be GGS 
with respect to many different weight functions. While the fact that an infinite group with 
property (T) is non-amenable is not a deep one, it does not seem that the groups from 
Theorem 112.81 provide the "real reason" for non-amenability of GS groups. 
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Finding a proof of non-amenability of GS groups which does not use property (T) is also 
of interest because it may shed some light on the following question of Vershik |Ve| which is 
still open. 

Question 14.1. Let G be a finitely generated group, let p be a prime, let M be the aug- 
mentation ideal of the group algebra ¥ P [G], and assume that the graded algebra gr¥ p [G] = 
©5£L M n /M n+1 has exponential growth. Does it follow that G is non-amenable? 

Corollary 14.31 implies that GS groups satisfy the above hypothesis, so a positive answer to 
Question 114. II would provide a new proof of non-amenability of GS groups. 

Our last problem deals with the Galois groups Gk, p ,s defined in § [TUJ As explained in 
§ [10j3, many such groups have an LRG (linear rank growth) chain, and it is natural to ask 
whether every chain is an LRG chain in those groups. Assuming that K,p and S are such 
that Gk, p ,s is infinite, the following conditions are easily seen to be equivalent: 

(a) Gk, p ,s has positive rank gradient. 

(b) Any (strictly) descending chain of open normal subgroups of Gk, p ,s is an LRG chain. 

(c) Let K = Kq C K\ C ... be a (strictly) ascending chain of finite Galois p-extensions 
of K unramified outside of S. Then the sequence {p p (K n )} of p-ranks of the ideal 
class groups of K n grows linearly in [K n : K\. 

Problem 10. Assume that the group Gk, p ,s is infinite and hypotheses of Theorem \10.5\ hold. 
Determine whether the equivalent conditions (a),(b) and (c) above hold or fail (depending on 
the triple (K,p,S)). 

We are not aware of a single example where the answer to this question is known. We 
conjecture that conditions (a),(b),(c) always fail, that is, the group Gk, p ,s always has zero 
rank gradient (under the above restrictions). This conjecture is based on the various known 
analogies between the groups Gk, p ,s and hyperbolic 3-manifold groups (see, e.g., [Rez] and 
|Mo| ). In particular, similarly to the groups Gk, p ,s, many hyperbolic 3-manifold groups have 
LRG p-chains by Theorem 111.51 At the same time, very deep recent work of Wise on quasi- 
convex hierarchies combined with a theorem of Lackenby |La4} Theorem 1.18] implies that 
for every hyperbolic 3-manifold group G and every prime p, the p-gradient of G (equal to the 
rank gradient of Gp) is zero. 
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